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Method for yrowiny stem g^^li^ 



The present invention is related to a mefliod for growing stem cells. 

Stem cells are commonly defined as cells which exist for the lifetime of an organism 
and are able to imdergo symmetric and/or asymmetric divisions, to give rise to fur- 
ther stem cells (for preservation of the stem cell pool) and to more differentiated 
cells with defined life-time (for organ-specific functions). Due to this unique pro- 
perty they are ideal vehicles for somatic gene therapy. They would maintain the 
transgene for the life-time of the tissue and the organism, and would carry the trans- 
gene expression into the differentiated cells. Stem cells may be totipotent (e.g, em- 
bryonic stem cells), pluripotent (e.g. hematopoietic stem cells) or unipotent (e.g. 
keratinocytes, muscle precursor cells). 

Although being the aim of many research projects, it has until now being very diffi- 
cult to grow stem cells, especially lineage-committed stem cells, thereby controlling 
expansion and differentiation of the stem cells, 

The present invention provides a method for growing stem cells comprising flic 
steps of 



• providing stem cells with supporters said supporters being genetically modified 
in order to provide externally regulatable interactions between the supporters and 
the stem cells; 

• applying an external signal for starting or stopping the interactions. 



According to the present invention, stem cells are co-incubated with supporters. 
These supporters are genetically modified to allow a regulatable interaction with the 
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Stem ccUs, These interactions between the supporters and th stem cells are exter- 
nally regalatable, "Externally regulatable" defines that the interaction between the 
supporters and the stem cells is regulated from outside of the supporters. 

Preferably, the interactions are based on the secretion or display of substances. 
These substances, which are secreted or displayed by the supporters control the 
development of the stem cells. The expansion or differentiation of the stem cells is 
indirectly controlled by the regulatable interactions of the supporteis. 

Preferably as external signals may serve the addition or removal of substances, heat, 
light, sound and/or electromagnetic waves. The only requirement is that these exter- 
nal signals are able to regulate the interactions between the supporters and the stem 
cells. 

Preferably, the supporters are cells. They may be stem cells or non-stem cells. In a 
preferred embodiment these supporters are forming a micro-environment. 

The supporting cells can be further transformed with foreign genes to express a gene 
product of interest e.g. a protein of the clotting cascade, insulin, enzymes growth 
factors or the like. 

It is believed that the supporters form a micro-environment thus providing cytokines 
and adhesion molecules as well as direct contact between the stem cells and the 
supporters. Suitable supporters are skin cells, lung cells, bone marrow stroma cells, 
or tissue cells. 

Suitable secreted or displayed substances are cell based growth factors, protein 
growth factors, interleukines. 



In particular, the supporters are genetically modified with a vector comprising a 
gene for the substances, e.g. interleukines, protooncogenes, oncogenes, cell cycle 
control genes, and/or cell based grow& factors and a regulatable e3q)ression system. 
A preferred regulatabie expression system is the regulatable tetracycline expression 
system. 

Preferred vectors for the transformation of tije supporters are the vectors selected 
from the group consisting of pRetn>-tet-o€f-E6/E7, pRetro-tet-off-U19-tsA58, 
pUHD15.1-6-gal-NeoR, pUHD10,3-TGFfl3, pUHD10.3.hlL3, pUHD10.3.hIL6, 
pD12YCVJC-long-CNTF, pD12YCVJC-long-GDNF, pDl2YCVJC-8hort-CNTF 
and pD12YCVJC-short-GDNF. 

Details on these vectors can be found in the exan^les and the figures. 

Cell lines obtainable by genetically modification of cells with fte vectois of the 
present invention form part of the invention. 

The present invention iurtfaer provides a method of curing diseases by gene therapy 
and/or cell therapy which diseases are related to insufficient, lack or disorder of 
stem cells, by administering to patients in need tiiereof, supporters being genetically 
modified in order to provide externally regulatable interactions between the siq>> 
porters and the stem oeUs^ Furthermore, any disease related to insufficient expr^ 
sion or activity of a protein or enzyme can be treated by administering supporters 
and/or stem cells after expansion ex vivo. 

Fig, 1 shows the appearance of hIL-3, depending on the addition/removal of doj^- 
cycline in vivo. 

Fig. 2 shows the shows die appearance of hIL-6 d^nding on the addition/removal 
of doxycycKne in vivo. 
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Fig. 3 shows the cloning of growth factor genes. 
Fig. 4 to 12 show the sequences of vectors. 

The method of the present invention is fluther explained by the following exan^le: 
Example 

hIL6 containing transgcnic-keratlnocytlc stem cells support tbe growth of 
(sister) keratlnocytlc stem cells in culture: In these sets of experiments. ELISA 
assays were performed with supemates obtained from one hIL6 of construct tcto- 
hIL6 containing kcratinocytic cell clone derived from a CMV-tTA x teto-SV40 T 
antigen transgenic mouse, either cultured alone, or with doxycycline included in the 
culture for 0-12h, 25-50h. This experiment is to test whether the secretion of cytoki- 
nes affects by doxycycline in culture. 

hIL6 promotes the growth of CMV-tTA x teto-SV40Tag transgenic keratino cytic 
stem cell line drastically by increasing cell numbers. At the presence of hIL6, the 
growth arrest at Gl compartment is abolished, and cells continue to grow in the 
presence of doxycycline. 

The level of hIL6 in the supemates was slightly inhibited at time point of I2h, and 
increase again at 50h, up to the level of the control cells, i.e., without doxycycline. 
Thus, hIL6-containing clone continued to secrete hIL6 despite of flie fact that doxy- 
cycline was included in the culture. 

The data are interpreted as follow: Engineered stem cells support the growfli of 
sister stem cells (internally or externally) in vitro by the combination of the follow- 
ing two mechanisms: 




(1) The hJL6 «igineered keratinooytic stem cells secrete hlL6 into supemate, target 
and promote the growth of sister keratinooytic stem cells - an extemal/heierocrine 
mechanism. 

(2) The gene product of hIL6 engineered keratinooytic stem cells, acts intra- 
cytoplasmically, and promotes the growth of itself - an intemal/autocrine mecha- 
nism. 

1. In both events, hIL6 is able to raaiatain skin in the keratinoytic stem cells com- 
partment, upon the withdraw of doxycycline. 

Cytoldne^ataining kerattnocytic stem ceUs and trachefti epithelial stem cells 
secrete cytoldaes Imown to support the growth of lienuitopoitic stem ceils 
(HSC): 

Using hIL3, hIL6, flk2/flt3Ligand to support the growth of HSC for several weeks 
in culture, and in comparing to that of stroma cell lines in supporting HSC, has been 
performed. In the literature there are many published data showing that hIL3. hIL6, 
flk2/flt3Ligand are essential to support the growth of HSC. Hiese data show that 
these cytokines are essential in maintaining HSC in culture, and in increasing the 
transduction efliciency of retroviruses into HSC in the two chamber culture system 
where the experiments were performed and described in the literature. 

Keratinoctic stem cell lines and tracheal epithelial stem cell lines were established 
from CMV-tTA x tetoCMVm-SV40Tag double transgenic mice (from H. Bujaid 
and S. Efrat). 

Stem cells are supported by cytoldiie-eoiitalniiig transgenic stem cells: Id vivo 
using Immnnolncompetent mice (nn/nu mice): It is for the purpose of somatic 
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delivery of growth factors essential for maintenance of human hematopoietic stem 
cells (HSC) in recipient hosts, The cytokines constructed shown to be fiinctional for 
HSC are hIL3, hIL6, and flk2/flt3Ligand, and thus are used further for in vivo ex- 
periments (below). 

Teto-hIL6, teto-hIL3 containing transgenic kcratinocytic and tracheal epithelial 
stem cells derived from CMV-tTA x teto-CNfVm-SV40Tag double transgenic mice 
were pre-cultured on DED (denuded dermis from human cosmetic operation) and 
implanted subcutaneously (flip-in) into immunoincompetent mice (nude mice). 

Blood samples from such nude mice were collected from a tail vein of mice periodi- 
cally. Sera were separated from blood clots. ELISA tests were perforaaed on serum 
samples collected. After cytokines were demonstrated to appear in blood, such nude 
mice ingested doxycycline (Img/ml) included in the drinking water and blood col- 
lected at the time points indicated. As indicated in figures 1 (hIL3) and 2 (hIL-6), 
hIL3 and hIL6 are shown to appear in the blood reaching a significant amount (14.7 
pg/ml for hlL3, 15,9 pg/ml for hIL6), and they were decreased when doxycycline 
was included the drinking water. Upon removal of doxycycline, hIL3 and hIL6 were 
shown to increase to higher levels again (41.2 pg/ml for hIL3, and 14.5pg/nil for 
hIL6), Upon reingestion of doxycycline, the levels of cytokines were shown to de- 
crease to zero. 

The mice survive over the 5-6months of experiment without any sign of illness due 
to the implantation of engineered mouse stem cells delivering human cytokines. The 
pattern can be cyclic. It is predicted that the protocol will work similarly in the 
SCID-NOD mice. Thus, in nude mice, we show that the secretion of cytokines such 
as hlU and hlL6 into the blood stream is regulated by doxycycline in the drinking 
water. 
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In summary, the above protocol of somatic engineering of immuno-incompetcnt 
mice with regulatable debvery of growth factors has been tested and shown to be 
deliverable to high titers in nu/nu mice. The growth of transgenic keratinocytes and 
lung epithelial cells, and the delivery of cytokines are shown to be subjected to the 
regulation of doxycycline (in culture of some cells, such as HETA cells but not 
other cells, such as a hlW-containing keratinocytic cell line» when doxycycline is 
included in the medium); and in vivo when inchided in the drinking water. 

The principle of this protocol can also apply to the support of the growth of stem 
cells of any kind, such as neural and glial stem cells, in immuno-incompetcnt mice, 
as a novel diagnostic tool for evaluating the preclinical and clinical protocols. 

EstabUshmetit and commercialization of SCIO-NOD-tau systems as diagnostics 
for growth and evaluation of self-renewal property of human neuronal and 
glial stem cdls, clinical protocol and for drug targeting; 

The keratinocytic stem cell line and tracheal epithelial stem cell line derived ftom 
CMV-trAxtetoCMVm-SV40Tag double transgenic mice are used in this type of 
experiment. These cell lines are inserted with cytokine constructs for fee somatic 
delivery of neurotropic factors essential for the survival and maintenance of human 
adult brain stem cells in recipient hosts. The cytokines constructed are pD12YCV- 
JC-diiven GDNF and CNTF. The transgenic tracheal epithelial stem cells are pre- 
cultured on DED (dead de-epidermized dermis) and implanted subcutaneously (flip- 
in) (in the head region) into SCflD-NOD mice. The growth of transgenic hmg kerati- 
nocytic stem cells and epithelial cells, and die delivery of cytokines have been 
shown to be subjected to the regulation of doxycycline in culture and in vivo when 
included in the drinking water. The mice survivie over the months of es^eriment 
without any sign of ilUiess due to the implantation of engineered mouse cells deliv- 
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ering human cytokines. The principle of the protocol will work similarly in the 
SCID-NOD mice for human neurotrophic fiwtors. 



Protocol for construcUon of pDllJCVPLong-CNTF plasmld 

Similar strategy and construction protocols held for pD12JCVPLong-GDNF, 
pD12JCVPshort-CNTF,pDl2JCVPshort-GDNF. 

1. pD12JCVPLcng vector (from E. Beck and J. Henson) was linearized upon Nail 
restriction enzyme. 

2. The sticky ends of the vector were filled using Kienow fragmenta of E. coli po- 
lymerase I. 

3. Digestion of the linearized pDl2JCVPLong vector with restriction enzyme Xhol. 

4. After digestion, the DNA sample was subjected to gel electrophoresis in 0.8% 
preparative agarose gel to obtain ca. 6.3 kb DNA fragment (pDl2JCVPLong x 
NsiyXhoI). 

5. pBS-hCNTF-079 vector (from E. Beck) was linearized with restriction enzyme 
Notl. 

6. The termini of the Imearized pBS'hCNTF-079 vector was filled with Kienow 
fragment of E. coH DNA polymerase I in order to obtain the blunt««nd. 

7. The linearized and blunt-ended (pBS-hCNTF-079 x Notl) was digested with 
Sahl. 
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8. After digestion, the DNA sample was subjected to gel electrophoresis in 0.8% 
preparative agarose gel and the 2469 bp DNA fragment containing CNTF gene 
was isolated. 

9. The blunt- and sticky ended (CNTF x Notl/Sahl) fragment (from step 8) was 



Nsil/Xhol) (from step 4) vector. 

lO.VXLl-blue competent bacteria E. coli was transformed wifli DNA (from step 9), 
and ampicillin resistant clones were selected, and characterized to be correct. 

ProtooiJ for coaseruction of pRetro^FF-£^7 plasmld: 

1. pLXSNE6E7 vector (from D. Galloway) was linearized upon EcoRI restriction 
enzyme digestion. 

2. The sticky ends of the vector was filled using Klenow fragments of E.colt DNA 
polymerase L 

3. The termini of the linearized pLX$N£6E7 was ligated with synthetic adaptor 
(Xhol-Notl-Bgin) purchased from Roth, Karlsruhe. 

4. The newly adapted-[pLXSNE6E7 x Notl/BamHI] (stqp 3) was digested mtb. NotI 
andBamHI. 

5. After digestion, the DNA sample was subjeted to gel eIectrophoi«sis in 1% 
preparative agarose gel to obtain ca. 830 bp fragment cf[ESE7 x NotUBamHI]. 



Hgated with complementary blunt- and sticky-ended (pD12JCVPLong x 



6. pRetro-OFF vector was digested with NotI and BamHI. 



7. The fragment of [E6E7 x Notl/BamHl] was then inserted into the [pRetto-OFF 
vector X Notl/BamHI] (step 6). 

8. XL 1 'blue competent bacteria E.coli were transformoi with the coaslruct from 
step 7, Ampicillin resistant clones were selected and characterized to be coirect. 

ProtDCol-2 for coDstructlou of pRetro-OFF-U19t8A58 plasmid: 

1. pZipNEOSV(x) vector (from P. Jat) was digested with BamHI restriction en- 
zyme. 

2. After digestion, the DNA san^Ie was electrophoresed in 0.8 % preparative aga- 
rose gel to obtain ca 2 .6 kb DNA fragment (Ul9tsA5 8 x BamHI). 

3. pRetro-OFF vector was linearized with restriction ens^roe BamHI. 

4. The terminal of the linearized pRetro-OFF vector was dephosphoried vrith 
Shrimp Alkaline Phophatase (USB) from Amersham. 

5. The fragment of (U19tsA58 x BamHI) (from step 2) was then inserted into the 
(pRetro-OFF vector x BamHI) (from step 4). 

6. The XLl-blue competent bacteria E. coli was transfonned with DNA {from step 
5). and ampicillin resistant clones were selected, and characterized to be conect. 

Protocol for the construction of pUHD-tranaactivator vectors: 

A. Transactivator, pUHD15.1-pCMV-tTA-B-gal-neomycin plasmid: 

1. pUHDlSa (from H. Bujard) was linearized using BamHI restriction enzyme. 
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2. 5' -end was dephosphorized using phosphatase, and the DNA sample was sub- 
jected to gel electrohoresis in 1% preparative agarose gel to obtain ca. 72SS bp 
fragment of (pUHDlS.lBamHI). 

3. IRES^Bgeo fragment which contains lacZn-neo (Ca 3050 bp)) was obtained from 
another plasmid (ptetotsASSIRESfigeo) (from H. Schoeler) using BamHI restric- 
tion enzyme digestion. 

4. Aiter digestion^ the DNA sample was subjected to gel electrohoresis in 1% prepa- 
rative agarose gel to obtain ca. 3050 bp fragment of (IRES-BgeoBainHI). 

5. The fragement of (ERES-BgeoBamHT) (step 4) was then inserted into the 
(pUHDlS.lBamHI) (step 2)B. 

6. XLNblue competent bacteria £. coli were transformed with the construct from 
step 5. AmpiciUin resistant clones were selected and characterized to be correct 

Protocol for the coBstruction of pUHD-responder vectors: 

B. Responder pUHDlO.3 cytokine plasmids: 

1. The multiple cloning site (MCS) of responder pUHDlO.3 (from H. Bujard) was 
linearized using EcoRI and SacII (for hIL6), or EcoRI and BamHI (for hIL3), or 
EcoRI and Xbal (for TGFfi3) restriction enzymes. 

2« After digestion* ^e individual DNA sample was subjected to gel electrohoresis in 
1% preparative agarose gel to obtain ca. 3150 bp fragment of DNA. 

3. Fragments of cDNA coding for hIL6 (EcoRI-SacII), hIL3 (EcoRI-BamHI), 
TGFfi3 (EcoRI-Xbal) were obtained from die original supplier (A. Bemad, Ge- 
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netic Institute, ATCC), and individual restriction enzyme digested as indicated in 
the original publications. 

4. After digestion, the DNA sample was subjected to gel electrohoresis in 1% prepa- 
rative agarose gel to obtain ca. 600 bp fragment of hIL6 (EcoRl-SacII), ca, 475 
bp fragment of hIL3 (EcoRI-BamHI). and ca. 1233 bp fragment of TGFB3 
(EcoRI-Xbai). 

5. The fragement coding for the respective cytokine gene (step 4) was then inserted 
into the responder pUHDlO.3 EcoRKSacII (for hIL6), or EcoRI-BamHI (for 
hlU), or EcoRI-Xbal (for TGFB3) (step 2). 



6. XLl-blue competent bacteria E, coU were transformed with the oonstnict from 
step 5. Ampicillin resistant clones were selected and characterized to be correct. 
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2. 
3. 

4, 

5. 
6. 
7. 



A method for growing stem cells comprising the steps of 

- providing stem cells with supporters said supporters being genetically modi- 
fied in order to provide externally regulatable interactions between Uie sup- 
porters and the stem cells; 

- applying an external signal for starting or stopping the interactions. 

The method of claim 1 wherein the interactions are based on secretion or dis- 
play of substances. 

The method of any of the claims 1 or 2 wherein the supporters are modified for 
the secretion or display of substances under control of a promoter. 

The method of any of claims 1 to 4 wherein the external signal is the addition 
or removal of substances, heat, light, sound and/or electromagnetic waves. 

The method of any of claims 1 to 4, wherein the supporters are non-stem cells. 

The method of any of claims 1 to 4, wherein the suppcnteis are stem cells. 

The method of any one of claims I to 6, wherein the supporters are forming a 
micro-environment 
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8- The method f any one of claims I to 7, wherein the supporters are skin cells, 
lung cells, bone marrow stroma cells or tissue cells. 

9. The method of any one of claims 1 to 8, wherein the supporters are secreting 
or displaying cell based growth factors, protein growth factors and/or inter- 
leukines. 

10- The method of any one of claims 1 to 8, wherein the supporters are trans- 
formed by a vector comprising a gene for interleukines, protooncogcnes, on- 
cogenes, cell cycle control genes, and/or cell based growth factors as well as a 
rcgulatable expression system, such as a tetracycline legulatable expression 
system. 

II. A vector selected from the group consisting of pRetro-tet-off-E6/E7, pRetro- 
tet-ofr-Ul5^tsA58, pUHDlS.l-fl-gal-NeoR, pUHD103-TGFfl3, pUHD103- 
hIL3. pUHD10.3-hIL6, pD12YCVJC4ong^CNTF, pD12YCVJC-long-GDNF, 
pD12YCVJC-short-CNTF and pD12YCVJC.short.GDNF, 

12- A supporter cell being genetically modified in order to provide a regulatable 
secretion and/or a display of substances of the supporters. 

13. A method of curing diseases by gene therapy and/or cell therapy which dis- 
eases are related to msufficient and/or lack and/or disorders of stem cells, by 
administering to patients in need thereof, supporters being genetically modi- 
fied in order to provide externally regulatable interaction between supporter 
cells and stem cells. 
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14. Cell lines obtainable by transforming cells with the vector according to claiml 1 . 
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A method for growing stem cells comprising the steps of 



providing stem cells with supporters said supporters being graetically modi- 
fied in order to provide externally regulatable interactions between the sup- 
porters and the stem cells; 



applying an external signal for starting or stopping tiie interactions. 
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Cloning of growth factor genes 
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QCTAQCGATX TAGOttaACIiC TATROAATAO ATCtcg cam nGTCACCCCT 
AGM3TCQM3C TGTOACOGTC CWACAAMU AXTGCMICTO OOTTATCTTC 
TTCCTt3ATGG CAaOQQTTAC AaaTAMSOGG CTCCCAAGTC CCAAACTTGA 
GGGTCCttTAA ACTCTOTQJW: AOTOOCAATC ACTTTGCCTT TCTTTCTACA 
(aOGGIGhATT CX3GCTTTCAC AOAOCATTCA CCOCTGACCC CTCACCOTCa 
GQAOCTCTGT AOCCOCTCTA TCTQ0CTJM3C AAOGAAGATX CJGTTCaGRCC 
TTQACIGCTC TTACQOAATC CTATOTAAST TGCCTATTTT QCTGrEATCT 
OTTTTCCCTT CATCTTTTTT GMTCCAOCAA CTTACCATCA CGCATCAGCT 
CCATTACC3VA TTGWGaUlAGC TCTAATCAIA TA6TCATTCA TATAaOTTAT 
TTTOACAWSGG CCCTTCCCTT GAGQMACCC ATOTGACTTT ATTTTCTTCC 
TCTOGGCTGT TTAGGM3ATO AAOTTftCTTG AATQAOAAAA TATAXATOGA 
OTTCTAGAAA OGATTGGTTT ATAtOTCTTO GAOQCTA-m CAAAATTTAT 
TTGOCCATAt ATTCTGAATA CTACCTAOAA CAQATTASCC ATOQOCCCTH 
TGOOTTMrTTC ATAAOCCATT GTTCTQWWT TTTITABCTT TOTAAATQAA 
MXSrrtKSQQ QATAOGAAGA GrNCTATOAA CGTGOQASOA ATTTGTAAAT 
CCTACCAATT TNTJICXAXAT ASCATtAOCC CCCACCTTTT AMIATTCTGC 
ATCAAAA8TA AGATTOTGTC TAAAOAOAAA GGTNABCIAT CAAAAGGACT 
CCKATAMIAT TCaiTTQGAAA CTTOTOGAAH TGTCAAATTT MTTTQAGCTA 
ATTNTTGGAO TTCCAAAHTT TOTCTIMTMA CAOIHAAaiaG QGAMCCCCAT 
TCAHATTTKC CCCCCTHHNG AKAATOCrrO GQOGAAAAAA CCINCCAACC 
CCTnOTOGG AWSaUUSTTTT TTTAAIMTPT XAA08CTMGN NQAAACHOaN 
TTITAATTTT TTGaaHCHAN COCCTHTCCC CQGTACCAGG AAAATCAOQA 

ccnmmTC ooomiaiiGCN cgscacmgqgg ogmaaaamqo GAAATrrcarr 

CAKAAAAAAT CTTTTCOGnn nimngtsaag catcngggee tgaacaagaa 
catcaacccg gsctetscgg atggsfttgec agtggcAage actgatcagt 
99«9t9agct aaocgaggea gagogactcc aagagaaccb tcaagcttat 



Fig- 4 
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1301 cgtaccttcc atgttttgtt ggccaggctc ttagaagacc agcaggtgca 

1351 tfcttacccca aecgaaggtg acttccatca agctatacat aeccttcttc 

1401 tccaagtcgc tgcctttgca taccagatag aggagttaat gatactcctg 

1451 gaatacaag tcccccgcaa tgaggctgat gggatgcctia ttauitgttgg 

1501 agatggtggt ctcttcgaga agaagctgtg gggcctaaag gtgctgcagg 

1551 agctttcaca gtggacagta aggtccatcc atgaccttcg cttcatttct 

1601 tctcatcaga ctgggatccc agcacgtggg agceatcata ctgctaacaa 

1651 caagaaaatg eagniuumgc ggecTGCGCC OTcmcccXv ACGTTAAAGG 

1701 G\T6AAACCA CMGACTTAC CITOGCTCOG AAGTAAAAOG ACJUUICACAC 

1751 ACM3TTTTGC CCGTTTTCAT QAOAAATGGG ACGTCTGOQC ACGJUUCGOS 

1801 CCGTCGCTTG AGGAGGACTT C3TACAAACAC QAXCXATOCA GGTTTCCCCA 

1651 ACTQACACAA ACOCTGkSVAC TIGAAACTCC GCCTOQTCrr TCCAGGTCTA 

1901 QAOOGCTrAAC ATTTTG1!ACT CrrGTTTQACT CCAOOCTCOA TCCACTAGCX3 

1951 AGTGTTA6TA OOGGTACTGC TGTCTCOXA0 CGGAaCATGT TGGCCGTGOG 

2001 AACACCXCCT TOQTAACAAG GACCCACOQG GCCOAAAGCC ATGTCGTAAC 

20$1 CSGACCCAACA TGTGTGGAAC CCCAOCACGG CAGCTTTACT GTGAAACCCA 

2101 CTTCAA0GTG ACATTGA7AC TGOtACTCAA ACACTG6TQA CAOQCTAAGG 

2151 ATGCCCrrCA 66TACCCCQA GGTAACAAGC GACACTCCXKS ATCTGAGAAG 

2201 QQOACTGGGA CTTCTTTAAA 6TCCCCAGTT TAAAAAfiCTT CTAOGCCTGA 

2251 ATAGGrGACC GGAOOCXedC ACCTTTCCTT T7ATAACCAC TGAACAGATG 

2301 GAAGAOGCCA AAAACATAAA QAAAGOCCOa QCC3CCATTCT ATCCTCTAOA 

2351 GGATGQAACC GC7GGAQAGC AACTGCATAA OOCXATGAAG AGATACQCCC 

2401 TGGTTCarOG AACAATTOCr TTTACAGATQ CACATATOGA OGTGAACATC 

2451 ACGTACGCGO AATACTTCGA AATQTMXOtT OGGTTOGCAG AAGCTATGAA 

2501 ACX3ATATGGG CTQAATACAA ATCACAGAAT CGT06TAT0C AOTQAAAACT 

2551 CTCTTCAATT CTTTATOCCG GTGTIOaOCXI CX2TTATTTAT CQGAGTrOCA 

2601 GTTGCGCCCX3 CGAAGQAGAT n'ATAATGAA OGTOAATTGC TCAACAGTAT 

2651 GAACATTTCG CAGCCTACCXS TACTSTTTOT TTCCAAAAA6 OGOTTOCAAA 
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2701 


AAATTTTGAA 


CGTGGAAAAA AAATTACCAA TAATCCAiGAA 


AATTATTATC 


2751 


ATGQIkrrCTA 


AAACGOATTA CCMSGQATTr CAGTCGMGT 


ACACGTTCGT 


2601 


CACATCTCAT 


CTACCTCCC6 GTTTTAATOA ATACGATTTT 


GTACCAGAGT 


2851 


CCTTTGATCG 


TGACAAAACA ATTCKSUrTGA TAATGAATTC 


CTCTGGATCT 


2901 


ACTQG»TTAC 


CTAAGGOTGT QdCCCTTCCG CATAQAACTG 


CCTGCGTCAG 


2951 


ATTCTOOCAT 


GCCAGAGATC CTATTTrtOd CAATCAAATC 


ATTCCXiGATA 


3001 


CTGOGATTTT 


AAGTGTTOTT CCATTCCATC ACOGTTTTGG 


AATGTTTACT 


3051 


ACACrOGGAT 


ATTTGATATG TOOATTTCOA OTCGTCTTAA 


TGTATAGATT 


3101 


TGMGAAGAG 


COIGTTTTTAC QATCCCTTCA GGATTACAAA 


ATTCAAAGTG 


3151 


CGXTGCTAGT 


ACCAACCCTA TTTrCATTCT TOGCCAAAAG 


CACTCTQATT 


3201 


GACAAATACG 


ATTTATCTAA TTTACAOOAA ATTGCTTCTG 


GGGGCGCACC 


3251 


TCTTTCGAAA 


GAAGTCGGGG AAOOUtfriHiC AAAACX3CTTC 


CATCTTCCAG 


33 01 


GQATAOQACA 


AGGATATGOG CTCACTGAGA CTACATCAGC 


TATTCTOATT 


3351 


ACACCCOATSa 


GGGATGATAA ACOGOGCGCG GTCGGTAAAG 


ttchtccatt 


3401 


TTTTGAAGOG 


AAOGTTGTGG ATCTGOAXAC COGGAAAACG 


CTGGGCGTTA 


3451 


ATCAaAOAOG 


CGAATTATQT GTCAGMBGAC CTATGATTAT 


GrrCCGGTTAT 


3501 


OTAAACAATC 


CGOAASCGAC CAACOCCITG ATTGACAAGG 


ATGGATGGCT 


35S1 


ACATrcTOQA 


GACATAOCTT ACTSaOAOGA AGACGAACAC 


TTCTTCATAG 


3601 




GAAGTCTTTA ATTAAATACA AAGGATATCA 


GGtGGCCCCC 


3£S1 


GCTGAATTOG 


AATCGATATT GtTACAACAC CCCAACATCT 


TCGACGCGGG 


3701 


CGTC3GCAGC5T 


CTTCCOGAOG ATGACGCCGO TOAACTTCCC 


GCOGCCGTTG 


37S1 




GGAOGGAAAG AOGATGA)CQO AAAAAOAGAT 


GGTGGATTAC 


TTGTTTTGGA 




3801 


GTCGCCAGTC 


AAGTAACAAC OGCGAAAAAG rCOCSOOGAG 


GAGTTGTGTT 


3851 


TGTGGACOAA 


GTACCGAAAG GTCTTACOSG AAAACTCGAC 


GCAAGAAAAA 


3901 




CCTCATAAAG GCCAAGAAGG GCGGAAAGTC 


CAAATTGTAA 


3951 


AAT6TAACTO 


TATTCAGOOA TQACGAAATT CTTAGCTATT 


GTAATGACrC 


4001 


TAGAGGATCr 


TTGTGAAOGA ACCTTACTTC TGTQGTGTGA 


CATAATTQGA 


4051 


CAAACTACCT 


ACAQAGATTT AAAOCTCTAA GGTAAATATA 


AAATTTTTAA 
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4101 GTGTATAAIO TGTTAAHCTA CTOATTCTAA TTGTTTGTGT ATTTTAGATT 
41S1 CCAACCTATG GAACTGATQA ATGOGAOCAQ TtSGTGGAATG CCTTTAATCA 
4201 OGAAAACCTG TTTTGCTCAG AAQAAATGCC ATCTAGTGAT GATOAGQCTA 
4251 CMCTGACTC TCAACATTCT ACPCCTCCAA AAAAOAAOAQ AJVAGGTAGAA 
4301 GACCCCAAGG ACTTTCCTTC AGAATTGCTA AOTrTTTTGA GTCATCCTGT 
4351 GTTTAQTAAT AGAACtCTTG CTTGCTTTGC TATTTACACC AC3UVA5GAAA 
4401 AAGCTOCACX GCTATACAAG AAAATTAtGG AAAAATATTC TGTAACCTTT 
4451 ATAAGTAOGC ATAACAGCTA TAATCATAAC ATACTGTTTT TTCTTACTCC 
4501 ACACAGQCAT AGAGTCTCM CTATTAATAA CtATGCTCAA AAATTGTGTA 
435 1 CCTTTAGCTT 7TTAATTTGT AAAGGOOTTA ATAAGGMITA TTTQATGtAT 
4601 AGTOCCTTGA CTAGAGATCA TAATCAGCCA lACCACATTT OTAOAGSOTTT 
4651 TACPrOCTTT AAAAAACCXC CCACACCTCC CCCTGAACCT OAAACATAAA 
4701 AT3AATGCAA TTGrrGTTGT TAACrTGTTT ATTOCAQCTT ATAATGGTTA 
4751 CAAATAAAGC AATAC3CATCA CAAAT-TTCAC AAICEAAAOCA TlTTTTTCAC 
4S01 TOCATTCTAG TTGTQC5TTTG TCCAAACtCA TCAATOTATC TtATCATGTC 
4851 TOOATCCCCG GaTCCCTATA GTQAGTCGTA rCAGCnOGC GTAATCATGG 
4901 TCATAGCTGT TTCCTGTQTG AAATTGTTAT CCGCTCACAA TTCCACACAA 
4952 CATACGAGCC GGAAGCATAA AGTGTAAAGC CTGGGaTGCC 1AATGA0TGA 
SOOl GCTAACrCAC ATTAATIGOG TTGCOCTCAC TGOCCGCtTT CCAGTOGOGA 
5051 AACCTGTCGT GCCAGCIGCA TTAATQAATC GGOCAAOGCG CGGGGASAOG 
5101 COGTTTGCGT ATTGGGCGCT CTTCOOCTTC CTC3GCTCACT OACTCGCTGC 
51S1 GCTCXWrCGT TOOOCTGOOG CXJAGCGGTAT CAGCTCACTC AAASGCGGTA 
5201 ATAOGGTTAT CXACASAATC AG6GGATAAC GCAOGAAAGA AGATGTGAOC 
5251 AAAAGGCX^G CAAAAGGCCA OOAACOGTAA AAAGQCOSOG ncCTGGCGT 
5301 TTTTCCATAG GCTCCGCCXTC CCTOAOGAGC ATCACAAAAA rOGACGCTCA 
5351 AGTCA8AG6T GGCGAAACCC GACAOGACTA 7AAAGATACC AGGCGTTTCC 
5401 CCCTOGAAGC TCCCTCX5TGC GCTCTCCTGT TCOGACCCTO COGCTTACCG 
54S1 GATACCIOTC COCCnTCTC CCTTCOGSAA GCOTGOCXXTT TTCTCAATGC 
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5501 TCACGCTGTA GGTATCTCAQ TTCGOTOTAG GTCOTTCGCr 

S551 CT6TGT0CAC GAACCCCCCG TTCJIOCCOGA COOCTGOSCC 

5601 ACTATCGTCT TOAGTCCAAC CCQCTAJUSAC ACGACTTATC 

S6S1 OCAOCCACTG GTAACAOOI^T TAGCJkGAOCSO A£3GTATGTAS 

5701 AfiAOTTCTTG AAGTOGTOGC CTAJVCTACGG CTACACTAOA 

5751 TTGGTATCTG OGCTCTGCTO AASCCAOTTA CCTTCaOAAA 

5801 AGCTCPTOAT CCGGCAAACA AACCACCGCT GQTAGOGGTG 

5651 TTOCAAQCAG CAGATTACGC GCAOAAAAAA AGGATCTCAA 

5901 TGATCTTTTC TAOGGGOTCT QA00CTCUU3X OGAACGAAAA 

5951 GGGtATTTTOG TCATGAflATT ATCAAAAAdG ATCTTCACCT 

6001 AAATTAAAAA TGAAOTTTTA AATCS^TCTA AAGTATATAT 

6051 QQTCraACAG WACeAATGC TTAATCACTC AGGCACCTAT 

6101 MTCTATTTC GTTCATCCAT AGnWCCTOA CTCCCOOTOG 

6151 TACOATACOG GftOOGCrTAC CATCTGOCCC CAGTGCTGCA 

6201 GAGACCCAOG CTCACCGOCT CCAGATTTAT CAGCAATAAA 

5251 OGAAGGGCGO AGCGCAGAAQ TOGTCCTGCA ACTTTATCCG 

6301 GTCTATTAAT TGrUGCCGGG AAGCTAOACT AABTAGTTOG 

63S1 GTTTGOGCAA CGTWrTOCC ATTGCEACAG GCATCQTGOT 

6401 TCGTTTCWTA TOGCTTCATT CAOCTCCOGT TCCCAACOAT 

6451 TACATQATCC CCCATGTTGT QCAAAAAAGC 00TTA6CTCC 

6501 COATCSGJWCrr CAGMGTAAG TTQOCCGCAG TOTTATCACT 

6551 GCAGCACTGC ATAATTCTCT TACTGTCATG CCATCOGTAA 

6601 TGTGACTOGr OAOXACTCAA CCAAOTCATT CTOAGAATAG 

6651 C2ACCX»GTTC CTCTTGCCOG GCOTCAATAC OQCSATAATAC 

6701 AOCAGAACTC TAAAAGTGCT CATCATTOOA AAAOOTTCTT 

6751 ACTCTGAAGG ATCTTACCGC TGTTOAOATC C*GXTCGATG 

6001 OTOCACCCSUl CT(»rCTTCA GCATCTTTTA CTTTCAOCAG 

6651 TQAGCAAAAA CAGGAAQ8CA AAATOCCOCA AAAMGGGAA 



CCAAGCTGGG 
TTATCCGGTA 
GCCACTGGCA 
GCGOTGCTAC 
AGGACAQTAT 
AAOAOTTGGT 
CnrrTTTTTQT 
GAAGATCCTT 
CTCACGTTAA 
ASATCCTTTT 
GAOTAAACTT 
CTCAGCGATC 
TGTAOATAAC 
ATGATACCOC 
CCAGCCAOCC 
CCTCCATCCA 
CCAGTTAATA 
GTCAOQCTCG 
CAAGGCX3A9T 
TTCGGTCCJtC 
CATGGTTATG 

GATOCcnrrrc 

TGTATOOOGC 
CGCQCCACAT 
CGGOGOQAAA 
TAACCCACrC 
CGTTTCTGG6 
TAASGOGGAC 
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6901 ACGGAAATOT TGAATACTCA TACTCTTCXTT TTTTCAATAT TATTGAAOCA 
69S1 TTTATCAfiGG TTATTQTCTC ATGAGOQQAT ACATATTTGA ATOTATTTAG 
7001 AAAAATAAAC AAATAGGGOT TCCGCX^CACA TTTCCCCOAA AAGTGCCACC 
70S1 TGACQTCTAA GAAACCATTA TTATCATQAC ATTAACCTAT AAAAATAG6C 
7101 GTATCACGAG GCCCTTTCGT CTCQCX5CGTT TCG6TGATGA OOGTGAAAAC 
7151 CTCTGACACA TGCAGCTCCC G<»GACGQTC ACAGCTTCTC TOXAMSCQOA 
7201 TGCCGGGAGC AGACAAGCCC GTCAfiGGCGC GTCAGOGQGT GTTGGCGGGT 
7251 GTCGOGGCTG GCTTAACTAT GOGGCATCAG AGCAGATTGT ACTOAOAGTC 

7301 CACCATATGC GGTGTGAAAT ACCGCACAGA TGOGTAAGGA GAAAATACXX3 

7351 CATCAGGCGC CATTCGCCAT TCAGGCTGOG CAACTQTTQQ GAAGGGCGAT 

7401 CGGTGCGOGC CTCTTCGCTA TTACGCCAGC TOGCQAAAGG GGGATGTGCT 

7451 OCAAGGCOAT TAAGTTQGGT AACGCCAGGG TTTTCCCAGT CACGACOTIG 

7501 TAAAACGACX3 GCCAGTGAAT TTCOACCTGC AGTCGACAOA AGCCTTAOGT 

7551 GACAGCTQQC GAAGAACCAT OGCXJ^GCTGG TGACAAGCCA AAACAGCTCT 

7601 GGCTCS3CAAA ACATGTTCXX: TTGGCTGCTT TOCACTTCCC CTTGlGCnT 

7651 GTTTACTTGT GTCAGCTGGT TGGCTCCCTA GGTATGAGCT CATGCXTOGC 

7701 IGGCAGCCAT CCAGrTTTAO CCAGCTCTGC TTTGTTTACT TGTGTCAGCT 

7751 GGITGGCTCC CTAGGTATGIA GCTCATOCTT GGCTGGCAGC CATCCAGTTT 

7801 TAGCCACCTC CTOCCTACCT TCCCOTTTT TTAXATATAC AGGAGGCOGA 

78 51 GGCCGCCTCC GCCTCCAAGC TTACTGAGAA GTAGTAAGG6 OGTGGAaOCT 

7901 TTTTAGGAGG CCAGGGAAAT TCCCTTO riT ITCCCCTTTT TOCAOTAAlT 

79S1 TTTTGCTGCA AAAAGCTAA 
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1 GCTAGCGATT tAGGTCjACAC TATAGAATAG ATCCCCATGA AGTTATGCGA 

51 TG7CGTGGCT GTCTGCCTGG TGCTGCTCCA CACCGCGTCC GCCTTCCCGC 

101 TGCCCGCCGG TAAGAGGCCT CCCGAGGCGC CCGCCGAAGA CCGCTCCCTC 

ibl GGCCGCCGCC GCGCGCCCTT CGCGCTGAGC AGTGACTCAA ATATGCCAGA 

201 GGATTATCCT GATCAGTTCG ATGATGTCAT GGATTTTATT CAAGCCACCA 

251 TTAAAAGACT GAAAAGGTCA CCAGATAAAC AAATGGCAGT GCTTCCTAGA 

301 agagagcgga atcggcaggc tgcagctgcc aacccagaga attccagagg 

•Vn AAAAGGTCGG AGAGGCCAGA GGGGCAAAAA CCGGGGTTGT GTCTTAACTG 

401 CAATACATTT AAATGTCACT GACTTGGG7C TGGGCTATGA AACCAAGGAG 

4 51 GAACTGATTT TTAGGTACTG CAGCGGCTCT TGCGATGCAG CTGAGACAAC 

i)01 GTACGACAAA ATATTGAAAA ACTTATCCAG AAATAGAAGG CTGGTGAGTG 

551 ACAAAGTAGG GCAGGCATGT TGCAGACCCA TCGCCTTTGA TGATGACCTG 

601 TCGTTTTTAG ATGATAACCT GGTTTACCAT ATTCTAAGAA AGCATTCCGC 

6il TAAAAOGTGT GGATGTATCT GACTGGtGCG CCCTCTTTCC CGACGTTAAA 

7 01 GGGATGAAAC CACAAGACXT ACCTTCGCTC GGAAGTAAAA CGACAAACAC 
751 ACACAGTTTT GCCCGTTTTC ATGAGAAATG GGACGTCTGC GCACGAAACG 
801 CGCCGTCGCT TGAGGAGGAC TTGTACAAAC ACGATCTATG CAGGTTTCCC 

8 51 CAACTGACAC AAACCGTGCA ACTTGAAACT CCGCCTGGTC TTTCCAGGTC 
90\ TAGAGGGGTA ACATTTTGTA CTGTGTTTGA CTCCACGCTC GATCCACTAG 
951 CGAGTGTTAG TAGCGGTACT GCtGTCTCGT AGCOGAGCAT GTTGGCCGTG 

1 DDI GGAACACCTC CTTGGTAACA aggacccacg GGGCCGAAAG CCATGTCCTA 
1051 ACGGACCCAA catgtgtgca accccagcac ggcagcttta ctgtgaaacc 
1101 cacttcaagc tgacattgat actggtactc aaacactggt gacaggctaa 

U^l GGATGCCCTT CAGGTACCCC GAGGTAACAA GCGACACTCG GGATCTGAGA 

1201 AGGGGACTGG GACTTCTTTA AAGTGCCCAG TTTAAAAAGC TTCTACGCCT 

I2bl GAATAGGTGA CCGGAGGCCC gcacctttcc ttttataacc actgaacaca 

1301 tggaagacgc caaaaacata aagaaaggcc cggcgccatt ctatcctcta 

1351 gaggatggaa ccgctggaga gcaactgcat aaggctatga agagatacgc 

14 01 cctggttcct ggaacaattg cttttacaga tgcacatatc gaggtgaaca 

M51 TCACGTACGC GGAATACTTC GAAATGTCCG TTCGGTTGGC AGAAGCTATG 

IbOl AAACGATATG GGCTGAATAC AAATCACAGA ATCGTCGTAT GCAGTGAAAA 

1551 CTCTCTTCAA TTCTTTATGC CGGTGTTGGG CGCGTTATTT ATCGGAGTTG 

1601 CAGTTGCGCC CGCGAACGAC ATTTATAATG AACGTGAATT GCTCAACAGT 

1651 ATGAACATTT CGCAGCCTAC CGTAGTGTTT GTTTCCAAAA AGGGGTTGCA 

nOl AAAAATTTTG AACGTGCAAA AAAAATTACC AATAATCCAG AAAATTATTA 

1751 TCATGGATl'C TAAAACGGAT TACCAGGGAT TTCAGTCGAT GTACACGTTC 

leOl GTCACATCTC ATCTACCTCC CGGTTTtAAT GAATACGATT TTGTACCAGA 

1?51 GTCCTTTGAT CGTGACAAAA CAATTGCACT GATAATGAAT TCCTCTGGAT 

1301 CTACTGGGTT ACCTAAGGGT GTGGCCCTTC CGCATAGAAC TGCCTGCGTC 

1951 AGATTCTCGC ATGCCAGAGA TCCTATTTTT GGCAATCAAA TCATTCCGGA 

2001 TACTGCGATT TTAAGTGTTG TTCCATTCCA TCACGGTTTT GGAATGTTTA 

2 051 CTACACTCGG ATATTTGAtA TGTGGATTTC GAGTCGTCTT AATGTATAGA 
2101 tTTGAAGAAG AGCTGTTTTT ACGATCCCTT CAGGATTACA AAATTCAAAG 
2151 TGCGTTGCTA GTACCAACCC TATTTTCATT CTtCGCCAAA AGCACTCTGA 
2201 TTGACAAATA CGATTTATCT AATTTACACG AAATTGCTTC TGGGGGCGCA 
2251 CCTCTTTCGA AAGAAGTCGG GGAAGCGGTT GCAAAACGCT TCCATCTTCC 
?.30T AGGGATACGA CAAGGATATG GGCTCACTGA GACTACATCA GCTATTCTGA 
2*^51 TTACACCCGA GGGGGATGAT AAACCGGGCG CGGTCGGTAA AGTTGTTCCA 
2401 TTTTTTGAAG CGAAGGTTGt GGATCTG3AT ACCGGGAAAA CGCTGGGCGT 
2 4^1 TAATCAGAGA GGCGAATTAT GTGTCAGAGG ACCTATGATT ATGTCCGGTT 
2S01 ATGTAAACAA TCCGGAAGCG ACCAACGCCT TGATTGACAA GGATGGATGG 
2551 CTACATTCTG GAGACATAGC TTACTGGGAC GAAGACGAAC ACTTCTTCAT 
2601 AGTTGACCGC TTGAAGTCTT TAATTAAATA CAAAGGATAT CAGGTGGCCC 
2 6bl CCGCTCAATT GGAATCGATA TtGTTACAAC ACCCCAACAT CTTCGACGCG 
2701 GGCGTGGGAO GTCTTCCCGA CGATGACGCC GGTGAACTTC CCGCCGCCGT 
2751 TGTTGTTTTG GAGCACGGAA AGACGATGAC GGAAAAAGAG ATCGTGGATT 
2aOT ACGTCGCCAG TCAAGTAACA ACCGCGAAAA AGTTGCGCGG AGGAGTTGTG 
2851 TTTGTGGACG AAGTACCGAA AGGTC1TACC GGAAAACTCG ACGCAAGAAA 
2901 AATCAGAGAG ATCCTCAtAA AGGCCAAGAA GGGCGGAAAG TCCAAATTGT 
2951 AAAATGTAAC TGTATTCAGC GATGACGAAA TTCTTAGCTA TTGTAATGAC 
3001 TCTAGAGGAT CTTTGTGAAG GAACCTTACT TCTGTGGTGT gacataattg 
3051 GACAAACTAC CTACAGAGAT T7AAAGCTCT AAGGTAAATA TAAAATTTTT 
3101 AAGTGTATAA TGTGTTAAAC TACTGATTCt AATTGTTTGT GTATTTTAGA 
3151 TTCCAACCTA TGGAACTGAT GAATGGGAGC AGTGGTGGAA TGCCTTTAAT 
3201 GAGGAA/^^CC TGTTTTGCTC AGAAGAAATG CCATCTAGTG ATGATGAGGC 
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TACTGCTGAC 
AAGACCCCAA 
GTGTTTAGTA 
AAAAGCTGCA 
TTATAAGTAC 
CCACACAGGC 
TACCTTTAGC 
ATAGTGCCTT 
TTTACTTGCT 
AAATGAATGC 
TACAAATAAA 
ACTGCATTCT 
TCTGGATCCC 
GGTCATAGCT 
AACATACGAG 
GAGCTAACTC 
GAAACCTGTC 
GGCGGTTTGC 

gcgctcggtc 
taatacggtt 

GCAAAAGGCC 
GTTTTTCCAT 
CAAGTCAGAG 
CCCCCTGGAA 
CGGATACCTC 
GCTCACGCTG 
GGCTGTGTGC 
TAACTATCGT 
CAGCAGCCAC 
ACAGAGTTCT 
ATTTGGTATC 
GTAGCrCTTG 
GTTTGCAAGC 
TTTGATCTTT 
AAGGGATTTT 

TtAaattaaa 
ttggtctgac 
tctgtctatt 

ACTACGATAC 
GCGAGACCCA 

ccggaagggc 

CAGTCTATTA 
TAGTTTGCGC 
CGTCGTTTGG 
GTTACATGAT 
TCCGATCGTT 
TGGCAGCACT 
TCtGTGACTG 
GCGACCGAGT 
ATAGCAGAAC 
AAACTCTCAA 
TCGTGCACCC 
GGTGAGCAAA 
ACACGGAAAT 
CATTTATCAG 
AGAAAAATAA 
CCTGACGtCT 
GCGTATCACG 
ACCTCTGACA 
GATGCCGGGA 
GTGTCGGGGC 
TGCACCATAT 
CGCATCAGGC 
ATCGGTGCGG 
CTGCAAGGCG 
TCTAAAACGA 
GTGACAGCTG 
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TCTCAACATT 
CGACTTTCCT 
ATAGAACTCT 
CTGCTATACA 
GCATAACAGT 
ATAGAGTCTC 
TTTTTAATTT 
GACTAGAGAT 
TTAAAAAACC 
AATTGtTGTT 
GCAATAGCAT 
AGTTGTGGTT 
CGGGTCCCTA 
GTTTCCTGTG 
CCGGAAGCAT 
ACATTAATTG 
GTGCCAGCTG 
GTATTGGGCG 
GTTCGGCTGC 
ATCCACAGAA 
AGCAAAAGGC 
AGGCTCCGCC 
GTGGCGAAAC 
GCTCCCTCGT 

tccgcctttc 

TAGGTATCTC 
ACCAACCCCC 
CTTGAGTCCA 
TGGTAACAGG 
TGAAGTGGTG 
TGCGCTCTGC 
ATCCGGCAAA 
AGCAGAtTTAC 
TCTACGGGGT 
GGTCATGAGA 
AATGAAGTTT 
AGTTACCAAr 
TCGTTCATCC 
CGCAGGGCTT 
CGCTCACCGG 
CGAGCGCAGA 
ATTGTTGCCG 
AACGTTGTTG 
TATGGCTTCA 
CCCCCATGTT 
GTCAGAAGTA 
GCATAATTCT 
GTGAGTACTC 
TCCTCTT6CC 
I'TTAAAAGTG 
GGATCrXACC 
AACTGATCTT 
AACAGGAAGG 
GTTGAATACT 
GGTTATTGTC 
ACAAATAGGG 
AAGAAACCAT 
AGGCCCTTTC 
CATGCAGCTC 
GCAGACAAGC 
TGGCTTAACT 
CCGGTGTGAA 
CCCATTCGCC 
GCCTCTTCGC 
ATTAAGTTGG 
CGGCCAGTGA 
GCGAAGAACC 



CTACTCCTCC 
TCAGAATTGC 
TGCTT6CTTT 
AGAAAATTAT 
TAIAATCATA 
TGCTATTAAT 
GTAAAGGGGT 
CATAATCAGC 
TCCCACACCT 
GrTAACTTGT 
CACAAATTTC 
TGTCCAAACT 
TAGTGAGTCG 
TGAAATTGTT 
AAAGTGTAAA 
CGTTGCGCTC 
CATTAATGAA 
CTCTTCCGCT 
GGCGAGCGGT 
TCAGGGGATA 
CAGGAACCGT 
CCCCTGACGA 
CC6ACAGGAC 
GCGCTCTCCT 
TCCCTTCGGG 
AGTTCGGTGT 
CGTTCAGCCC 
ACCCGGTAAG 
ATTAGCAGAG 
GCCTAACTAC 
TGAAGCCAGT 

caaaccaccg 
gcgcagaaaa 
ctgacgctca 

TTATCAAAAA 
TAAATCAArC 
GCTTAATGAG 
ATAGTTGCCT 
ACCATCTGGC 
CTCCAGATTT 
AGTGGTCCTG 
GGAAGCTAGA 
CCATTGCTAC 
TTCAGCTCCG 
GTGCAAAAAA 
AGTTGGCCGC 
CTTACTGTCA 
AACCAAGTCA 
CGGCGTCAAT 
CTCATCATTG 
GCTGTTGAGA 
CAGCATCTTT 
CAAAATGCCG 
CATACTCTTC 
TCATGAGCGG 
GTTCCGCGCA 
TATTATCATG 
GTCTCGCGCG 
CCGGAGACGG 
CCGTCAGGGC 
ATGCGGCATC 
ATACCGCACA 
ATTCAGGCTG 
TATTACGCCA 
GTAACGCCAG 
ATrTCGACCT 
ATGGCCAGCT 



AAAAAAGAAG 
TAAGTTtTTT 
GCTATTTACA 
GGAAAAATAT 
ACATACTGTT 
AACTATGCTC 
TAATAAGGAA 
CATACCACAT 
CCCCCTGAAC 
TTATTGCAGC 
ACAAATAAAG 
CATCAATGTA 
TAPTAGCTTG 
ATCCGCTCAC 
GCCTGGGGTG 
ACTGCCCGCr 
TCGGCCAACG 
TCCTCGCTCA 
ATCAGCTCAC 
ACGCAGGAAA 
AAAAAGGCCG 
GCATCACAAA 
TATAAAGATA 
GTTCCGACCC 
AAGCGTGGCG 
AGGTCGTTCG 
GACCGCTGCG 
ACACGACTTA 
CGAGGTATGT 
GGCTACACTA 
TACCTTCGGA 
CTGGrAGCGG 
AAAGGATCIC 
GTGGAACGAA 
GGATCTTCAC 
TAAAGTATAT 
TGAGGCACCT 
GACTCCCCGT 
CCCAGTGCTG 
ATCAGCAATA 
CAACTTTATC 
GTAAGTAGTT 
AGGCATCGTG 
GTTCCCAACG 
GCGGTTAGCT 
AGTGTTATCA 
TGCCATCCGT 
TTCTGAGAAT 
ACGGGATAAT 
GAAAACGTTC 
TCCAGTTCGA 
TACTTTCACC 
CAAAAAAGGG 
CTTTTTCAAT 
ATACATATTT 
CATTTCCCCG 
ACATTAACCT 
TTTCGGTGAT 
TCACAGCTTG 
GCGTCAGCGG 
AGAGCAGATI 
GATGCGTAAG 
CGCAACTGTT 
GCTGGCGAAA 
GGTTtTCCCA 
GCAGTCGACA 
GGTGACAAGC 



AGAAAGGTAG 
GAGTCATGCT 
CCACAAAGGA 
TCTGTAACCT 
TTTTCTTACT 
AAAAATTGTG 
TATTTGATGr 
TTGTAGAGGr 
CTGAAACATA 
TTATAATGGT 
CATTTTTTTC 
TCTTATCATG 
GCGTAATCAT 
AATTCCACAC 
CCTAATGAGT 
TTCCAGTCGG 
CGCGGGGAGA 
CTGACTCGCT 
TCAAAGGCGG 
GAACATGTGA 
CGTTGCTGGC 
AATCGACGC? 
CCAGGCGTTT 
TGCCGCTTAC 
CTTTCTCAAT 
CTCCAAGCTG 
CCTTATCCGG 
TCGCCACTGG 
AGGCGGTGCT 
GAAGGACAGT 
AAAAGAGTTG 
TGGTTXrTTT 
AAGAAGATCC 
AACTCACGTT 
CTAGATCCTT 
ATGAGTAAAC 
AtCTCAGCGA 
CGTGTAGATA 
CAATGATACC 
AACCAGCCAG 
CGCCTCCATC 
CGCCAGTTAA 
GTGTCACGCT 
ATCAAGGCGA 
CCTTCGGTCC 
CTCATGGTTA 
AAGATGCTTT 
AGTGTATGCG 
ACCGCGCCAC 
TXCGGGGCGA 
TGTAACCCAC 
AGCGTTTCTG 
AATAAGGGCG 
ATTATTGAAG 
GAATGTATTT 
AAAAGTGCCA 
ATAAAAATAG 
GACGGTGAAA 
TCTGTAAGCG 
GTGTTGGCGG 
GTACTGAGAG 
GAGAAAATAC 
GGGAAGGGCG 
GGGGGATGTG 
GTCACGACGT 
GAAGCCTTAC 
CAAAACAGCT 
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6601 CTGGCTCGCA AAACATGTTC CCTTGGCTGC TTTCCACTTC CCCTTGTGCT 

6651 TTGTTTACTT GTGTCAGCTG GTTGGCTCCC TAGGTATGAG CTCATGCTTG 

6701 GCTGGCAGCC ATCCAGTTTT AGCCAGCICT GCTTTGTTTA CTTGTGTCAG 

6-7^1 CTGGTTGGCT CCCTAGGTAT GAGCTCATGC TTGGCTGGCA GCCATCCAGT 

6801 TTTAGCCAGC TCCTCCCTAC CTTCCCXTTT TTTTATATAT ACAGGAGGCC 

bH51 GAGGCCGCCT CCGCCTCCAA GCTTACTCAG AAGtAGTAAG GGCGTGGAGG 

6901 CTTTTTAGGA GGCCAGGGAA ATTCCCTTGT TrTTCCCTTT TTTCCAGTAA 

69!j3 TTTTTTGCTG CAAAAAGCTA A 
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pD12JCV»Bhort-3iCmr 

Lengchi 7558 

1 GCTAGCGATT TAGGTGftCAC TATAGMTCt cgacnnGTCJl CCXXTTAGAGT 

51 CGAGCTOTGA COGTCCTTAC AATQMATGC ANCTOOGTCft TCTTCTTCCT 

101 OATGGCAGGG OTTACAGGTA AOOOOCTCCC AACJTCCCAAA CTTCAGGOrC 

151 CATAAACTCT GrTGACAGTGG CWVTCACTTT GCCTTTCTTT CTACAOGOGT 

201 GAATTCGGCT TTCACAGAGC AITGACCGCT QACXTCXTTCAC CGTCGGGACC 

251 TCTGTAGCCG CrCTA3rTGG CTAGCAAGQA AOMTTCxaTTC AGftCCTTGAC 

3 01 TGCTCTTACG GAATCCTATG TAAGTTGCCT ATTTTQCrGT TATCTGTTTT 

351 CCCTTCATCT TTTrTGATCC AGCAACTTAC CATCAOSCAT CAGCTCCAIT 

401 ACCAAXTGTQ AAASCTCXAA TCATATAiSTC ATTCATATAG GTTATTrQAC 

451 ATGPGGCCCTT CCCTTGAQGA AftCCCATaXG ACTTTATTTT CTTCCTCTOG 

501 GCTGTTTAGG AGATGAAGTT ACTTCSAATGA GAAAATATAT ATGGAGTTCT 

551 ACJAAAGGATT GOTTTATATG TCTTGGAOGC TATTTCAAAA TTTATTTGGC 

601 CATAXATTCT GAATACTACC TASAAGAGAT TAGCCAT6G6 CCCTRTXSGGT 

651 TNTTCATAAG CCATTGTTCT aAAHTTTTTT AGCTTTGTAA ATGAAAOGTr 

701 TATOOGATAO GAAGAGTIICT ATGAA0GTG6 GAGGAATTTO TAAATCCTAC 

751 CAATTTimarC TATATAOCAT TAOCCCCCAC CTTTTANTAT TCtOCATCAA 

801 AAOtAAQAlT OtCTCTAAAO AQAAAOOTNA GCTATCAAAA GGACTCCTAT 

851 AAHATTQUrr GGAAACmTP GGAAKfTCTCA AATTTNTTTC AGCTAATTKT 

901 ItSGAGTTCCA AANTTTGTCT nmtACASnr AAOGK30GMIC CCGATTCANA 

9S1 TTTiTCCCMCC TKMNOAHAAT GCTTGQCJOaA AAAAACCTNC CAACCCCNTT 

lOQi OTOSGAmAA arrrrrTTAA nhttttaaog ctognngaaa CNGGfirrmA 

X05X ATTrrrrOOO NCNAHOiCCI NTCCCCGOTA CCAOOAAAAT CAGGA CC I Mf 

1101 Tmoaaam (aiaoicozAC lioooooaiAA aanoggaaat ttcmtcamaa 

1151 AAAATCTTTT CCGkmnnnng tgaagcatca gggcctgaac aaga&catca 
1201 acccggactc tgcggatggg atgccagtgg caagcactga tcagtggagt 
1251 gagctg ccg aggcagagcg actccaagag oaccttcaag ctcatcgtac 
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1301 ctcccatgtt ttgttggcca ggctcttaga aga cagcag gtgcatttta 

1351 ccccaa cga aggtgacttc catcaagcta taeataccet tcttctecaa 

1401 gtcgctgcct ttgcat cca gatagaggag ttaatgab c tccbggaata 

14S1 caag tcccc cgcaatgagg etgatgggat gcctattaat gttgg gatg 

1501 gtggtctcct tgaga^aag ctgtggggcc taaaggtgct geaggagctt 

1S5X toacagtgga cagtaaggtc catccatgac cttcgtttoa tttettctea 

1601 ccagactggg ateccagcao gtgggagcca ttatattgct aacaacaaga 

1651 aaatgtagnn nnngcggccT GGOCXXitcrT TOCCQACGTT AAAOQGATGA 

1701 AACCACAAGA CTTACCTTCG CTCQQAAOTA AAA08ACAAA CACACACAOT 

1751 TTTGCCCGTT TtCATGAOAA ATGQGACWTC TGOGCACGAX ACGCOCOSTC 

1501 GCTTGMSGAO GACTTGTACA AACACOAtTCT ATGCAQOtTT CCCCAACTOA 

1851 CACAAACCGT GCAACTTGAA ACTCCQCCTO GTCTTTCCAO GTCTAGAOaO 

1901 GTAACATTTT OIACTGTGTX TGACTCCACG CTOGATCCAC TAGCGAOTCZT 

19S1 TAGTAGCOOT ACTGCTGXCT CGTW3C3GAG CAXGTTG6CC GTGOQAACAC 

2001 CTOCTTGOTA ACAAGGACCC AOOGGGCCGaW AAOCCATGTC CTAAOGGACC 

2051 CMCfOXSetGrS OCSVACCCCAQ Ca^CGGCAOCT TTACTGTOAA ACCCACTTCA 

2101 AGOTGacATT GATACTGOTA CTCAAACACT OaiGACAOGC TAAGGATOCC 

21 SI CTTCaflGTAC CCCGAGGTAA eSUtfSCOACAC TCGGGATCTG AGAAOGGQAC 

2201 TOOGACTTCT TTAAAOTOCC CAGTTTAAAA AGCTTCTACG CCTGAATAOG 

2251 TOACCGGAGO CCGOCACCTT TCCTTTTATA ACCACTGAAC ACATGQAAGA 

2301 COCCAAAAAC ATAAAGAAAG GCCCGOOGCC ATTCTATCCT CTAGAGGATG 

23 SI GAAOCGCTOG AGAGCAAC1K5 CATAAOGCTA TGAAGAGATA COCCCTGGTT 

2401 CdGGAACAA TTGCTTTtAC AGATOCACAT ATC0M3GTGA ACATCACOTA 

2451 CaCXSGAATAC TTCXSAAATGT COOTTOGWTT GGCAGAAGCT ATGAAACQAT 

2501 ATGOGCTGAA TACAAATCAC AGAATCOTCG lATOCAOTGA AAACTCTCTT 

2551 CAATTCTTTA TGCCGGTOTT GGGGGOGTTA TTTATOGGAO TTGCAOTIOC 

2601 GCCCGCGAAC GACATTTATA ATGAACXJTOA ATTOCTCAAC ACTATOAACA 

2651 TTTCGCAGCC TACOOTAffTG TTTGTWCCA AAAAGGGGTT GCAAAAAATT 
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2701 TTQAACGTOC AAAAAMATT ACCAATAATC CAOAAAATTA TTATCATGGA 
275X TTCXAAMVCG GATTACCM36 QATXTCAiOTC OATC^ACAOG TTOGTCACAT 
2601 CTCATCTACC TCCCG GTX ' A T AATQAATAOG ATtTTGrACC AQAGTCCTrr 
2851 GATCQTOACA AAAC3VATTGC ACTOATAATG AATTCCTCTG GATCTACTGG 
2901 6TTACCTAAO GaTGTGGCCC tTCOGCATAG AACTOCCTGC GTCAOATTCT 
2951 CGCATGSCCAG AGATCCTATT TTTGGCAATC AAATCATTCC GGATACT6CS 
3001 ATTTTAACrXG TTOTTCCATT CCATCAOOGT TTTGGAATQT MACTACACT 
3051 CGGATATTXXS ATATQTGGAT TTCQAOT06T CTTAATGIAT AOATTTGAAO 
3101 AAOAGCTGrT TTTAOOATCX: CTTCJUSOAIT ACAAAATTCA AAOTGOGnrTG 
3151 CTAOIACCAA CCCTATTTTC ATTCXTCGtCC AAAAOCACTC TGATT6ACAA 
3201 ATACGATTTA TCTAATTTAC ACGAAAWGC TTCTOGaaGC GCACCTCTTr 
3251 OCSAAAOAAGT CGGOQAAGCG GTTCX»AAAC OCRCCATCT TCCAGOOATA 
3301 CQACAACSQAT AXQGGCTCAC TOACSACTACA TCAGCTATTC TCUkTTACACC 
3351 OGAaaaaOAT OATAAACOOG OCaoaQTCOO rAAAOTTOTT CCATTTTTTG 
3401 AAGC0AAO9T TntSOATCTG GATACCX30QA AAACGCTGQG CGTTAATCAO 
3451 AOAOGOGAAT TATGTOTCAfS AGGACCTATG ATTATOTCCG GTTATOTAAA 
3501 CAATCOOOAA GCGACCAAOS CdTaATTCA CAAOQAXGGA TGGCTACATT 
3551 erCQAGACAT AGCTTACIGG OACOAAOAOO AACACTTCTT CATAOnOAC 
3601 CGCTTQAAGT CTTTAATTAA ATACAAAGGA TAtCACGTOG CCCCCGCTCA 
3651 ATraOAATOO ATATTGTTAC AACACCCCAA CATCTTCGAC aCQCSGGSTOG 
3701 CAGGTCTTCC CGACQATGAC OCCGGTOAAC TTCCOGCOQC 0(»l!GTTaTT 
3751 TTGGA0CAC6 GAAAOAOCSAT GAOQGAAAAA eAfiATOOTQOl ATTAOGTCGC 
3801 CASTCAM3TA ACAACOGOGA AAAAGTTOOO CGGAOGAOTT GTGTTTOTGG 
3851 ACGAACnCACC GAAAGGTCTT ACCOGAAAAC TCGAOGCAAG AAAAATCAOA 
3901 QAGATOCTCA TAAAOGCCAA OAAQGGC6GA AAOTCCAAAT TGTAAAATGT 
39S1 AACtCTATTC ASCGATQACG AAATTCTXAfi CTATIGTAAT GACTCIASAO 
4001 GATCnTCTG AAOGAAOCIT ACTTCTGT(» TOTQACATAA TTQGSACAAAC 
4051 TACCTACAOA GATTTAAAGC TCTAAOGTAA ATATAAAATT rtTAAGTGXA 
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4101 TAATOTGTTA AACTACTGAT TCTAATTGrrT TOTOTATTTT AGATTCCAAC 
4151 CTATOOAACI GATCAATOOQ AOCAaTGGTG GRATOCCTTT AATGAQGAAA 
4201 ACCTCSTTTTQ CTCAGAAOAA ATOCCATCTA GTGATGATQA OOCTACTGCT 
4251 GACTCTCAAC ATTCTACTCC TCC3UUU«AG AAGM3AAAQ0 TAGAAGACCC 
4301 CAAGGACTTT CCTTCAGAAT TOCTAAQTTr TTTOAGTCAT OClVlt/riTA 
435 1 GTAATAGIUC TCTTOCTTGC TTTQCTATTT ACACCACAAA GGAAAAAGCT 
4401 GCACTOCTAT ACAAGAAAAT TATGGAAAAA TATTCTOTAA CCTTTAIAAQ 
4451 TAiSGCATAAC AfilTATAATC ATAACATACT GTTXnTCTT ACTCCACACA 
4501 QGCAXM3AGT GTCTGCTATT AATAACTAIG CTCAARAATT GTSTACCTrt 
4551 AOCrrrrPAA TTTOTAAAGG GGTTAATAAO qAAtAmCGA TGTATAOTOC 
4601 CTTOACtAflA GATCATAATC AGCCATACCA CATTTOTAGA GOTTTTACTT 
4S51 OCTTTAAAAA ACCTCCCACA CCTCCCCCTO AACCTGfcAAC ATAAAATQAA 
4701 TGCAATTGTT onTGTTAACT TOTXTATIGC AGCTTATAAT OOTTACAAAT 
4751 AAAGCAATAG CATCACAAAT TTCACAAATA AAOCATTTTT tTCACTGCAT 
4801 TCTAfltTGTG GTTTCrrCCAA ACTCATCAAT <1TATC«ATC ATGTCTGGAT 
4851 CCXttOOOTOC CTATAQTCAG TCOTATTAlGC TTOGCOTAAT CATQGTCATA 
4901 OCTGTTTCCT OTOTGAAATT GTTATCCaCT CACAATTCCA GACAACATAC 
4951 OAGCCGGAAG CATAAAGTGT AAASCCTQQO GIOCCTAATO AOTOA0CTAA 
5001 CTCACATTAA TTGCGTTGCG CTCACTQCCC GCTTTCCAGT COOGAAACCT 
5051 OTCOIGCCAO CTOCATTAAT OAATCGGCCA AGGOGCOGOG A«AGGCGOTT 
5101 TGOOTATTQG GCGCTCTTCC GCTTCCTCGC TCACWkACTC GCTGGGCTCO 
5151 GTCGTTCOQC TGCGGCOAOC GCTATCAOCT CACTCAAAQG CGGTAATAOG 
5301 aTTATCCACA GAATCAGOOG AtAACOCASG AAAOAAGAXG TGAOCAAAAG 
5251 OCCAGCAAAA GGCC3M3GAAC COTAAAAAGO COGCGTTGCT OOCOTTTTTC 
5301 CATAGGCTCC GCCCCCCTGA COAGCATCAC AAAAATCGAC GCTCAMWCA 
5351 GAI30TGGCGA AACCCGACAO OACTATAAAO ATAGCAGGCX3 TTTCCCCCTG 
5401 GAAOCTCCCT CGTGCOCTCT CCTGTTCCaA CCCTGCCGCT TACCGGATAC 
5451 CTGTCC6CCT TTCTCCCTTC GQGAAGOOTO GCGCTTTCTC AATGCTCACX3 
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S501 CT8TA061AT CTGASTTOOG TGTAEtSTOaT T03CICCAAG CIGQGCTOZtS 
SS51 TGCACX3AACC CCCCGTTCJIG CCGGM:oQCT OCI3CCTTATC CQGTAACTAT 

S601 CXTrcrrOAOT ocaacgcogi aaqacacgac ttatcgccac togcaggagc 

5€S1 CACTGQTAAC AOOATKAOCA GAOCOAOGTA. TSTAQGOGGT GCTACA6A6T 
5701 TCTTSAAGTG GTOGCCTAAC TACQ6CTACA. CTAGAAGCSAC AfiTATTIGGT 
S7S1 ATCTQCGCTC TGCTdAAOCC ASITACCTTC GGAAAAAOAG rrGGTAOCTC 
5801 TTGMTCCGGC AMkCAAACCA COGCTGQTAG OQOTGCTTTT TTTCnTTGCA 
5BS1 AOC A QCAOftT TACGCQCAGIV AAAAAAGOAT CTCAMSAAOA TCCTTTOATC 
5901 TTTTCTAOOG GGTCTGAOQC TCAOTOGAAC OAAAACTCAC GTXAAGGOAT 
S951 TnoGTCATG AQATTATCAA AAAOQATCTT CACCTAOATC CTTTTAAATT 
6001 AJVAAATOAAG TTTTAAATCA ATCtAXAOTA TATATGAGTA AACTTGGTCT 
6051 GACAGTTACC AAtGCTTAAT CAOTGAGGCA CCTATCTOW CGATCTOTCT 
6101 ATTTOarrCA TCCATftfiTIG CCTGACTCOC OSTOOIGTAG ATAACTACGA 
6151 TACOGOAGGQ CTTACCATCT OGCCCGAGTG CTQCAA1GAT ACOGGOMSAC 
6301 CCACGCTCAC CGOCTCCAOA TTTATCASCA ATAAACCAOC CAOCCQQAAG 
6251 GQC0GA80QC AQUUGTOQTC CTOCAACTTT ATCOGCCTCC AXCCAGTCTA 
6301 TCAATTQXTC CCOGQIkABCT AOAOTAAOTA OTTOGCCAGT TAAIAGTTTG 
6351 C0CAA0QTT6 TTGCCATYGC TACAOOCATC GiaJTOTCAC q CTOGT Ca TI 
6401 TQCTZATGQCT TCATTCAOCT CC0I3TTCCCA AaSKTCAAQG OOAGTTACAT 
6451 GATCCCCCAT GTTGTGCAAA AAAaOGGTTA OCTOCTTCGC TCCTOSQATC 
6501 OTTGTCAOAA QrCXUSTtOGC CGCAQTOITA TCACTCATOQ TTAtGGCASC 
6551 ACrSGATAAT TCTCTTACTC TCA1GCX3VPC CCTAAQAT6C mTCIGTOA 
6601 CTQGrFGAfilA CTCAACCM6 TCAITCraAG AAXAGTGTAT QG6GCQIICCG 
6651 AGRGCrCXT QCCCXSOOOtC AJCTAGOGGAT AAZACCGCXSC CACAtAOCAG 
6701 AACTTTAAAA GTOCTCATCA TTQOAAAACO TTCTTOOGGG CGAAAACTCT 
6751 aVAOGATCTT ACCQCTGTTO AGATCC3U3IT oaATGZAACC CACTCX3XGCA 
6801 CCCAACIGAT CTTOUSCATC TTTTACTTTC ACCAGCGTTT CTOOGtOAGC 
6851 AAAAACASOA AOQCAAAAIO COG C AAAAAA GGGftATAAOG acca^ChCGGA 
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6901 AATGTTGtfAT ACTCKTACTC TTCCTTTTTC AMATrATTG AAGC^TTTAT 

69S1 CAXKaGTrATT CTCTCATGA8 OGGAIACATA TTTGAATOTA TTTAGAAAAA 

7001 TAAACAAATA GGGGTTCCOC GCACATITCC CCOAAAAGTG CCACCTGACG 

7051 TCTAAaAAAC CAITATTATC ATOACATTAA CCTATAAAAA TAGGCOTATC 

7101 ACGAQOCCCT TTCGTCTCX3C GCOTtTCQOT OATGACGGTQ AAAACCTCTG 

71S1 ACACATOCMS CTCCCGGA3A C5GGTCACW3C TIOTCTGTAA OCGGATGCCO 

7201 QOAGCAGACA A8CC0GVCA6 OOCGCGTCAG 0QGGTG7TG6 CGGOTOTCGG 

72 SI GGCnWSCTTA ACTATOOQGC ATCAaAGCAG ATTGTACTOA CaGTGCACCA 

7301 TAT00QGT6T GUUkATACCXSC ACAQATGCQT AASGAGftAAA TACCGCATCA 

73S1 GOCOCCATTC OCCATTCAflG CTOOCICAACT OTTQGGAAGG GCGATOQGTG 

7401 CGeOCCTCTT CGCTATTACG CCaOCTOaOG AAnQOGGGAT GIOCTOCAAG 

7451 G0C3»TTAACST TGGGTAACOC CaWJOtfiTTTC OCAOTCACGA C6TTGTAAAA 

7501 C(3ACG6CCACS TGAATTtCGA CCraCAOtcg acCttCttta tataCAcagg 

7551 aggjccgag 
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JCVPshort-hgdnf Length: 656b June S, 1999 16:57 Type: N Check: 



1 
51 

101 
151 
201 
21)1 
301 
351 
401 
451 
50] 
551 
501 
653 
7C1 
7bl 
901 
851 
901 
951 
1001 
1051 
1 101 
1153 
1201 
1251 
1301 
1351 
1401 
lAbl 
1501 
1551 
3 601 
1651 
1701 
1751 
1801 
1651 
1901 
1951 
2001 
?051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2652 
2701 
2751 
2801 
2851 
2901 
2951 
30C1 
3051 
3101 
3151 



GCTAGCCAIT 
TCTCGTGGCT 
TGCCCGCCGG 

ggccgccgcc 
ggattatcct 
ttaaaagact 
agagagcgga 
aaaagg7cgg 
caatacattt 
gaactgatt7 
gtacgacaaa 
acaaagtagg 
tcgtttttag 
taaaagctgt 
gggatgaaac 
acacagtttt 
cgccgtcgct 
caactgacac 
tagaggggpa 
cgagtgttag 
ggaacacctc 
acggacccaa 
cacttcaagg 

GGATGCCCTT 
AGGGGACTGG 
GAATAGGTGA 
TGGAAGACGC 
SAGGATGGAA 
CCTGGTTCCT 
tCACGTACGC 
AAACGATATG 
CTCTCTTCAA 
CAGTTGCGCC 
ATGAACATTT 
AAAAATTTTG 
TCATGGATTC 
GTCACATCTC 
GTCCTTTGAT 
CTACTGGGTT 
AGATTCTCGC 
TACTGCGATT 
CTACACTCGG 
TTTGAAGAAG 
TGCGTTGCTA 
TTGACAAATA 
CCTCTTTCGA 
AGGGATACCA 
TTACACCCGA 
TTTTTTGAAG 
TAATCAGAGA 
ATGTAAACAA 
CTACATtCTG 
AGTTGACCGC 
CCGCTGAATT 
GGCGTGGCAG 
TGTTGTTTTG 
ACGTCGCCAG 
TTTGTGGACG 
ftATCAGAGAG 
AAAATGTAAC 
TCTAGACGAT 
GACAAACTAC 
AAGTGTATAA 
TTCCAACCTA 



TAGGTGACAC 
GTCTGCCTGG 
TAAGAGGCCT 
GCGCGCCCTT 
GATCAGTTCG 
GAAAAGGTCA 
ATCGGCAGGC 
AGAGGCCAGA 
AAATCTCACT 
TTAGGTACTG 
ATATTGAAAA 
GCAGGGATGT 
ATGATAACCT 
GGATGTATCT 
CACAAGACTT 
GCCCGTTTTC 
TGAGGAGGAC 
AAACCGTGCA 
ACATTTTGTA 
TAGCGGTACT 
CTTGGTAACA 
CATGTGTGCA 
TGACATTGAT 
CAGGTACCCC 
GACTTCriTA 
CCGGAGGCCG 
CAAAAACATA 
CCGCTGGAGA 
GGAACAATTG 
GGAATACTTC 
GGCTGAATAC 
TTCTTTATGC 
CGCGAACGAC 
CGCAGCCTAC 
AACGTGCAAA 
TAAAACGGAT 
ATCTACCTCC 
CGTGACAAAA 
ACCTAAGGGT 
ATGCCAGAGA 
TTAAGTGTTG 
ATATTTGATA 
AGCTGTTTTT 
GTACCAACCC 
CGATTTATCt 
AAGAAGTCGG 
CAAGGATATG 
CGGGGATGAT 
CGAAGGTTGT 
GGCGAATTAT 
TCCGGAAGCG 
GAGACATAGC 
TTGAAGTCTT 
GGAATCGATA 
GTCTTCCCGA 
GAGCACGGAA 
TCAAGTAACA 
AAGTACCGAA 
ATCCTCATAA 
TGTATTCAGC 
CTTTGTGAAG 
CTACAGAGAT 
TGTGTTAAAC 
TGGAACTGAT 



TATAGAATAG 
TGCTGCTCCA 
CCCGAGGCGC 
CGCGCTGAGC 
ATGATGTCAT 
CCAGATAAAC 
TGCAGCTGCC 
GGGGCAAAAA 
GACTTGGGTC 
CAGCGGCTCT 
ACTTATCCAG 
TGCAGACCCA 
GGTTtACCAT 
GACTGGTGCG 
ACCTTCGCTC 
ATCAGAAATG 
TTGTACRAAC 
ACTTGAAACT 
CTGTGTTTGA 
GCTGTCTCGT 
AGGACCCACG 
ACCCCAGCAC 
ACTGG'TACTC 
GAGGTAACAA 
AAGTGCCCAG 
GCACCTTTCC 
AAGAAAGGCC 
GCAACTGCAT 
ClTTTACAGA 
GAAATGTCCG 
AAATCACAGA 
CGGTGTTG6G 
ATTTAtAATG 
CGTAGTGTTT 
AAAAATTACC 
TACCAGGGAT 
CGGTTTTAAT 
CAATTGCACT 
GTGGCCCTTC 
TCCTATTTTT 
TTCCATTCCA 
TGTGGATTTC 
ACGATCCCTr 
TATTTTCATT 
AATTTACACG 
GGAAGCGGTT 
GGCXCACTGA 
AAACCGGGCG 
GGATCTGGAT 
GtGTCAGAGG 
ACCAACGCCT 
TTACTGGGAC 
TAATTAAATA 
"TTCTTACAAC 
CGATGACGCC 
AGACGATGAC 
ACCGCGAAAA 
AGGTCTTACC 
AGGCCAAGAA 
GAT6ACGAAA 
GAACCTTACr 
TTAAAGCTCT 
TACTGATTCr 
GAATGGGAGC 



ATCCCCATGA 
CACCGCGTCC 
CCGCCGAAGA 
AGTGACTCAA 
GGATTTTATT 
AAATGGCAGT 
AACCCAGAGA 
CCGGGGTTGT 
TGGGCTATGA 
TGCGATGCAG 
AAATAGAAGG 
TCGCCTTTGA 
ATTCTAAGAA 
CCGTCTTTCC 
GGAAGTAAAA 
GGACGTCTGC 
ACGATCTATG 
CCGCCTGGTC 
CTCCACGCTC 
AGCGGAGCAT 
GGGCCGAAAG 
GGCAGCTTTA 
AAACACTGGT 
GCGACACTCG 
TTTAAAAAGC 
TTTTATAACC 
CGGCGCCATT 
AAGGCTATGA 
TGCACATATC 
TTCGGTTGGC 
ATCGTCGTAT 
CGCGTTATTT 
AACGTGAATT 
GTTTCCAAAA 
AATAATCCAG 
7TCAGTCGAT 
GAATACGATT 
GATAATGAAT 
CGCATAGAAC 
GGCAATCAAA 
rCACGGTTTT 
GAGTCGTCTT 
CAGGATTACA 
CTTCGCCAAA 
AAATTGCTTC 
GCAAAACGCT 
GACTACATCA 
CGGTCGGTAA 
ACCGGGAAAA 
ACCTATGATT 
XGATTGACAA 
GAAGACGAAC 
CAAAGGATAT 
ACCCCAACAT 
GGTGAACTTC 
GGAAAAAGAG 
AGTTGCGCGG 
GGAAAACTCG 
GGGCGGAAAG 
TTCTTAGCTA 
TCTGTGGTGT 
AAGGTAAATA 
AATTGTTTGT 
AGT6GTGGAA 



AGTTATGGGA 
GCCTTCCCGC 
COGCTCCCTC 
ATATGCCAGA 
CAAGCCACCA 
GCTTCCTAGA 
ATTCCAGAGG 
GTCTTAACTG 
AACCAAGGAG 
CTGAGACAAC 
CTGGTGAGTG 
TGATGACCTG 
AGCATTCCGC 
CGACGTTAAA 
CGACAAACAC 
GCACGAAACG 
CAGGTTTCCC 
TTTCCAGGTC 
GATCCACTAG 
GTTGGCCGTG 
CCATGTCCTA 
CTGTGAAACC 
GACAGGCTAA 
GGATCTGAGA 
TTCTACGCCT 
ACTGAACACA 
CTAXCCTCTA 
AGAGATACGC 
GAGGTGAACA 
ACAAGCTATG 
GCAGTGAAAA 
AXCGGAGTTG 
GCTCAACAGT 
AGGGGTTGCA 
AAAATTATTA 
GTACACGXTC 
TTGTACCAQA 
TCCTCTGGAT 
TGCCTGCCTC 
TCATTCCGGA 
GGAATGTTTA 
AATGTATAGA 
AAATTCAAAG 
AGCACTCTGA 
TGGGGGCGCA 
TCCATCTTCC 
GCTATTCTGA 
AGTTGTTCCA 
CGCTGGGCGT 
ATGTCCGGTT 
GGATGGATGG 
ACTTCTXCAT 
CAGGTGGCCC 
CTTCGACGCG 
CCGCCGCCGT 
ATCGTGGATT 
AGCJAGTTGTG 
ACGCAAGAAA 
TCCAAATTGT 
TTGTAATGAC 
GACATAATTG 
TAAAATTTTT 
GTATTTTAGA 
TGCCTTTAAT 
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i201 GAGGAAAACC TGTTTTGCTC AGAAGAAATG CCATCTAGIG ATGATGAGGC 

3251 TACTGCT«5AC ^rCTCAACATT CTACTCCTCC AAAAAAGAAG AGAAAGGTAG 

3301 AAGACCCCAA GGACTTTCCT TCAGAATTGC TAAGTTTTTT GAGTCATGCT 

3351 GTGTTTAGTA ATAGAACTCT TGCTTGCTTT GCTATTTACA CCACAAAGGA 

3401 AAAAGCTGCA CTGCTAIACA AGAAAATtAT GGAAAAATAT TCTGTAACCT 

3 4=)1 TTATAAGTAG GCATAACAGT TATAATCATA ACATACTGTT TTTTCTTACT 
3501 CCACACAGGC ATAGAGTGTC TGCTATTAAT AACTATGCTC AAAAATTGTG 
3*^51 TACCTTTACC TTTTTAATTT GTAAAGGGGT TAATAAGGAA TATTTGATGT 
H601 ATACTGCCTT GACTAGAGAT CATAATCAGC CATACCACAT TTGTAGAGGT 
3651 TTTACTTGCT TTAAAAAACC TCCCACACCT CCCCCTGAAC CTGAAACATA 
3701 AAATGAATGC AAtTGTTGTT GTTAACTTGT TTATTGCAGC TTATAATGGT 
■-J751 TACAAATftAA CCAATA&CAT CACAAATTTC ACAAATAAAG CATTTTTTTC 
3801 ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 
3851 TCTGGATCCC CGGGTCCCTA TAGTGAGTCG TATTAGCTTG GCGTAATCAT 
'3901 GGTCATAGCT CTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC 
3951 AACATACGAG CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT 
4001 GAGCTAACTC ACATTAATTG CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG 

4 051 GAAACCTCTC GTGCCAGCIG CATTAATGAA TCGGCCAACG CGCGGGGAGA 
4101 GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 
4151 GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCACCTCAC TCAAAGGCGG 
4 201 TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA GAACATGTGA 
4?.bl GCAAAAGGCC AGCAAAAGGC CAGGAACCGT AARAAGGCCG CGTTGCTGGC 
4 301 GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA AATCGACCCT 
4 351 CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT 
4 4 01 CCCOCTGGAA GCTCCCTCGT GCGCTCTCCt GTTCCGACCC TGCCGCTTAC 
4 451 CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT 
4b01 GCTCACGCtG TftGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG 
4 551 GGCTGTGTCC ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG 
4 601 TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG 
4 651 CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT 
4701 ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT 
47 51 ATTTGGTATC TGCGCTCTGC TGAAGCCAGt TACCTTCGGA AAAACAGTTG 
4801 GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT 
4Sbl GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC 
4 901 TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT 
4 951 AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT 

50 01 ttaaattaaa aatgaagttt taaatcaatc TAAAGTATAT ATGAGTAAAC 

5051 ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 

blol tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata 

Slbl ACTACGATAC GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC 

5201 GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGOAATA AACCAGCCAG 

'j251 CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC CGCCTCCATC 

5301 cagtciatta attgttgccg ggaagctaga GTAAGTAGTT cgccagttaa 

«r^51 TAGTTTGCGC AACGTTGtTG CCATTGCTAC AGGCATCGTG GTGTCACGCT 

54 01 CGTCGTTTCG TATGGCTTCA TTCAGCTCCG GTTCCCAACG ATCAAGGCGA 

b4 51 GTTACATGAT CCCCCATGTT GTGCAAAAAA GCGGTTACCT CCTTCGGTCC 

5501 TCCGATCGTT GTCAGAAGtA AGTTGGCCGC AGTGTTATCA CTCAtGGTTA 

5551 TGGCAGCACT GCAtAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT 

5601 TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG 

5651 GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC 

*^)701 ATAGCAGAAC TTTAAAAGTC CTCATCATTG GAAAACGTTC TTCGGGGCGA 

5 751 AAACTCTCAA GGATCTTACC GCTGTTGAGA TCCAGTTCGA TGTAACCCAC 

5801 TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC AGCGTTTCTG 

585 1 GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG AATAAGGGCG 

S^^Ol ACACGGAAAT gttgaatact CATACTCTTC CTTTTTCAAT ATTATTGAAG 

»^951 CATTTATCAG GGTTATTGTC tCATGAGCGG ATACATAtTT GAATGTATTT 

^nOl AGAAAAATAA ACAAATAGGG OTTCCGCGCA CATTtCCCCG AAAAGTGCCA 

6051 cc tgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag 

felOl gcgtatcacg aggccctttc gtctcgcgcg tttcggtgat gacggtgaaa 

61 HI ACCTCTGACA CATGCAGCTC CCGGAGACGG TCACAGCTTG TCTGTAAGCG 

6201 GATGCCGGGA GCAGACAAGC CCGTCAGGGC gcgtcagcgg gtgttggcgg 

625] GTGrCCGGGC TGGCTTAACT ATGCGGCATC AGAGCAGATT GTACTGAGAG 

6301 tccaccatat GCGGTGTGAA ATACCGCACA GATGCGTAAG GAGAAAATAC 

6351 CGCATCAGGC GCCATTCGCC AITCAGGCTG CGCAACTGTT GGGAAGGGCG 

64 01 ATCGGTCCGG GCCTCTTCGC TATTACGCCA GCTGGCGAAA GGGGGATGTG 

64 51 CTGCAAGGCG ATTAAGTTGG GTAACGCCAG GGTTTTCCrA GTCACGACGT 

6501 TGTAAAACGA CGGCCAGTGA ATTTCGACCT GCAGtcgaCT: tttttratat 
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1 

51 
101 

201 
251 
301 
351 
401 
451 
£>01 
55>1 
601 
651 

■?oi 

751 

eol 

90} 

9[:1 
1001 
1051 
1101 
11^.1 
1201 
1251 

I3ni 

1351 
1401 
14S1 
1501 
1551 
1601 
1G51 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
21i>l 
2201 
2251 
2301 
2351 
2401 
2451 
/»i)01 
2S51 
2601 

27ni 
2751 
2801 
2ttbl 
2901 
2951 
3001 
30 'j1 
HlOl 
3151 



TCGAGTTTAC 
ACCACTCCCT 
CTATCAGTCA 
RTAGAGAAAA 
AAGTGAAAGi: 
TCGAGTTTAC 
GGTACCCGGG 
CTCGTTTAGT 
GACCTCCATA 
attcaccggt 
caatgtttca 
acagagctqc 
caagcaacag 
tatgcatagt 
ttaaagtttt 
gtatgaaaca 
taattaggtg 
agacatctgg 
cggtcyatgt 
agctgtaatc 
atttqcaacc 
agctcagagg 
ggacagagcc 
cgcttcggtt 
gacctgttaa 
accataai-cr, 
AITGCATTCA 
TAAAGCAAGT 
AAGCCTCGTC 
CGCGCTCCAT 
TCCAAGGGCA 
TAAATCCCGC 
TCGTCTAGCG 
GAGCTCGTCC 
GCGTGTGTCC 
CTTCGGGGGG 
CCGTAATTGT 
TTTAAGTTGT 

aggctggctc 
aatggcggca 

tTGATGCTCT 
CGCtGAGTGC 
AAGGCTAATT 



ACCTAAATG'r 
AAC?TTTAGC 
GGGCAAAAGT 
GAGCAAAGCC 
CACCTAGCTT 
TCATTAAGCA 
AGACATGGTG 
TCACTACTTC 
AAATAAAAAA 
GGAGTTAGGG 
CTAATTGAGA 
ACTTTCCACA 
CTGCCTGCTG 
CCACAGGTCG 
TGGGGTCGTG 
I'CAGGCACCG 
ACTCGACGTC 
TTGCGGGCCG 
CGCCTCCACT 
GGTCGGGCGA 
CGGTGCGGCG 



CACTCCCTAT 
ATCAGTGATA 
TAGAGAAAGT 
GTGAAAGTCG 
CGAGTTTACC 
CACTCCCTAT 
TCGAGTAGGC 
GAACCGTCAG 
6AAGACACCG 
tagt.ataaaa 
ggacccacag 
aaacaactat 
rtactgcgac 
atatagagat. 
at tctaaaat 
auattagaac 
tattaactgt 
^icaaaaagca 
a^ytcttgtt 
atgcatggag 
aqagacaact 
nggaggatga 
cattacaata 
gtgcgtacaa 
tgggcacact 
accatggctg 
ttttatgttt 
aaaacctcta 
gtctggccgg 
gagcagagcg 

TCGGTAAACA 
ACCCGGGGAA 
CGTCGGCATG 
CCCAGGCTGA 
GCGGGGAGAA 
GTCGTCGTCC 
TTTTCGTACG 
TTTTCTAATC 
TOCACCtTGO 
TACTATCAGT 
TGATCTTCCA 
ATATAATGCA 
GATTTTCGAG 
ACTTTTGCTC 
GTTATTACGT 
GAGTArCGTG 
CGCTTATTTT 
CTGGCCGAGT 
GCTCTAATGC 
GAAGCTTTTT 
TGGAATAGCT 
AATTAGTCAG 
GCGGGATGGG 
TGCAtGCTTT 

cctggttgct 
gggagcctgg 

ACTAGATCGA 
CGCTCCTTTC 
GGCTTGCGGG 
GGCGGTGACG 
CGGAGGTCTC 
CCGGGGAGCA 
GACGCCGACG 
CCAGGAGGCC 



CAGTGATAGA 
GAGAAAAGTG 
GAAAGTCGAG 
AGTTTACCAC 
ACTCCCTATC 
CAGTGATAGA 
GTGTACGGTG 
ATCGCCTGGA 
GGACCGATCC 
gcag&cattt; 
gagogaccca 
acatgatata 
gtgaggtata 
gggaatccat 
tagtgagtat 
agca^tacaa 
caaaagccac 
aagattccat 
gcagatcatc 
atacacctac 
gatctcract. 
aatagatggr 
ctgtaacctt 
agcacacacg 
aggaattgtg 
atcctgcagg 
CAGGTTCAGG 
CAAATGTGGT 
ACCACGCTAT 
TCGCGCCCCC 
GAGCGCCGTA 
TCCCCGTCCC 
CGCCATCGCC 
CATCGGTCGG 
AGGACAGGCG 
GGGAGATCGA 
CGCGCGGCTG 
CGCATATGAT 
TGATCAAATA 
AGTAGGTGTT 
ATACGCAACC 
TTCTCTAGTG 
AGTTTCATAC 
CATCGCGATG 
AAAAAATCT7 
CCTATCTAAC 
TTACATGCCA 
TTACGGGTTG 
GCTGTTAATC 
GCAAAAGCCT 
CAGAGGCCGA 
CCATGGGGCG 
CGGAGTTAGG 
GCATACTTCT 
GACTAATTGA 
GGACTTTCCA 
ATTCTCAATT 
GGTCGGOCGC 
TCAtGCACCA 
GTGAAGCCGA 
CAGGAAGGCG 
CGACGGCGCT 
GTGGCCAGGA 
TTCCATCTGT 



GAAAAGTGAA 
AAAOTCGAGT 
TTTACCACTC 
TCCCTATCAG 
AGTGATAGAG 
GAAAAGTGAA 
GGAG5CCTAT 
GACGCCATCC 
AGCCTgcggc 
tatgcaccaa 
gaaagtracc 
atattagaat 
tgactttgct 
augctgtatg 
agacattatt 
caaaccgtcg 
tatgtcctga 
aatataaggg 
aagaacacgt 
attgcatgaa 
gttatgagca 
ccagctggac 
ttgttgcaag 
tagacattcg 
tgccccatct 
atCCCCCGGG 
GGGAGGTGTG 
ATGGCTGATT 
CTGTGCAAGG 
TACCCACCGT 
GGGGGCGGAG 
CCAACATGTC 
ACGTCCTCGC 
GGGGGCCGTC 
CGGAGCCGCC 
GCAGGCCCTC 
TACGCGGACC 
CAATTCAAGG 
ATTCGATAGC 
TCCCTTTCTT 
TAAAGTAAAA 
AAAAACCTTG 
TGTTTTTCTG 
ACTTAGTAAA 

gccagctttc 
atctcaatgg 
atacaatgta 
ttaaaccttc 
actttacttt 

AGGCCTCCAA 
GGCGGCCTCG 
GAGAATGGGC 
GGCGGGACTA 
GCCTGCTGGG 
GATGCATGCT 
CACCCTAACT 
GTTTTACGCG 
TGCGGGTCGT 
GGTCGCGCGG 
GCCGCTCGTA 
GGCACCCCGC 
GCCCAGACCC 
ACCACGCGGG 
TGCTGCGCGG 



AGTCGAGTTT 
TTACCACTCC 
CCTATCAGTG 
TGATAGAGAA 
AAAAGTGAAG 

agtcgagctc 
ataagcagag 
acgctgtttt 
cgcagatcta 
aagagaactg 
acagttatgc 
gtgtgtactg 
tttcgggatt 
tgataaatgt 
gttatagttt 
tgtgatttgt 
agaaaagcaa 
gtcggtggac 
agagaaaccc 
tatatgttag 
attaaatgac 
aagcagaacc 
tgtgactcta 
tactttggaa 
gttctcagaa 
AACAACAACA 

ggaggttttt 
atgatcctgc 

TCCCCGGACG 

actcgtcaat 
tcgtgggggg 
cagatcgaaa 
cgtataagtg 
gacagtctgc 
agccccgcct 
gatggtagac 
cactttcaca 
ccgaataaga 
ttgtcgtaat 
ctttagcgac 

TGCCCCACAG 

ttggcataaa 
taggccgtgt 

GCACATCTAA 

cccttctaaa 

CTAAGGCGTC 
GGCTGCTCTA 
GATTCCGACC 
TATCtAATCT 
AAAAGCCTCC 
GCCTCTGCAT 
GGAACTGGGC 
TGGTTGCTGA 
GAGCCTGGGG 
TTGCATACTT 
GACACACATT 
GCCCGATGCA 
GGGGCGGGCG 
TCCTTCGGGC 
GAAGGGGAGG 
CGCGCTCGGC 
TTGCCCTGGT 
CTCCTTGGGC 
CCAGCCGGGA 
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3201 ACCGCTCAAC TCOGCCATGC GCGGGCCGAT CTCGGCGAAC ACCGCCCCCG 
CTTCGACGCT CTCCGGCGTG GTCCAGACCG CCACCGCGGC GCCGTCGTCC 

330 1 GCGACCCACA CCTTGCCGAT GTCGAGCCCG ACGCGCGTGA GGAAGAGTTC 

3351 TTGCAGCTCG GTGACCCGCT CGATGTGGCG GTCCGGATCG ACGGTGTGGC 

.■14 01 GCGTGGCGGG GTAGTCGGCG AACGCGGCGG CGAGGGTGCG tACGGCCCTG 

34 51 GGGACGTCGT CGCGGGTGGC GAGGCGCACC GTGGGCTTGT ACTCGGTCAT 

3501 GGTAAGCTGA TCCGGCCGGC GCCTAGAGAA GGAGTGAGGG CTGGATAAAG 

3551 GGAGGATTGA GGCGGGGrCG AAAGAGGAGG TTCAAGGGGG AGAGACGGCG 

3601 CGGATGGAAG AAGAGGAGGC GGAGGCTTAG GGTGTACAAA GGGCTTGACC 

3651 CAGGGAGGGG GGTCAAAAGC CAAGGCTTCC CAGGTCACGA TGTAGGGGAC 

3701 CTGGTCTGGG TGTCCATGCG GGCCAGGTGA AAAGACCTTG ATCTTAACCT 

3751 GGGTGATGAC GTCTCGGTTA AAGGTGCCGT CTCGCGGCCA TCCGACGTTA 

3801 AAGGTTOGCC ATTCTGCAGA GCAGAAGGTA ACCCAACGTC TCTTCTTGAC 

3851 ATCTACCGAC TGGTTGTGAG CGAGCCGCTC GACATCTTTC CAGTGATCTA 

3901 AGGTCAAACT TAAGGGAGTG GTAACAGTCT GGCCCTAATT TTCAGACAAA 

3 951 TACAGAAACA CAGTCAGACA GAGACAACAC AGAACGATGC TGCAGCAGAC 

4 001 AAGACGCGCG GCTTCGGTTC CAAJ^CCGAAA GCAAAAATTC AGACGGAGGC 
4 051 GGGAACTGTT TTAGGTTCTC GTCTCCTACC AGAACCACAT ATCCTGACGG 
4101 GGTCOGATTC CACATCGACT CCCTTCCTCA GGTCGGGCCA CAAAAACGGC 
4151 CCCCAAAGTC CCTGGGACGT CTCCCAGGGT VGCGGCCGGG TGTTCAGAAC 
4 201 TCGTCAGTTC CACCACGGGT CCGCCAGA7A CAGAGCTAGT TAGCTAACTA 
4 251 GTACCGACGC AGGCGCATAA AATCAGTCAT AGACACTAGA CAATCGGACA 
4 301 GACACAGATA AGTTGCTGGC CAGCTTACCT CCCGGTGGTG GGTCGGTGGT 
4 351 CCCTGGGCAG GGGTCTCCCG ATCCCGGACG AGCCCCCAAA TGAAAGACCC 
4 401 CCGCTGACGG GTAGTCAATC ACTCAGAGGA GACCCTCCCA AGGAACA6CG 
4 451 AGACCACAAG TCGGATGCAA CTGCAAGAGG GTTTATTGGA TACACGGGTA 
4bOl CCCGGGCGAC TCAGTCAATC GGAGGACTGG CGCCCCGAGT GAGGGGTTGT 
4 551 GGGCTCTTTT ATTGAGCTCG GGGAGCAGAA GCGCGCGAAC AGAAGCGAGA 
4 601 AGCGAACTGA TTGGTTAGTT CAAATAAGGC ACAGGGTCAT TTCAGGTCCT 
4 651 TGGGGCACCC TGGAAACATC tGATGGTTCT CTAGAAACTG CTGAGGGCTG 
4701 GACCGCATCT GGGGACCATC TGTTCTTGGC CCTGAGCCGG GGCAGGAACT 
4 751 GCTTACCACA GATATCCTGT TTGGCCCATA TTCAGCTGTT CCATCTGTTC 
4 901 TTGGCCCTGA GCCGGGGCAG GAACTGCTTA CCACAGATAT CCTGTTTGGC 
4 8S1 CCATATTCAG GCTGCAGGTG GCACTTTTCG GGGAAATGTG CGCGGAACCC 
4 901 CTATTTGTTT ATTTTTCTAA ATACATTCAA ATATGTATCC GCTCATGAGA 
4 951 CAATAACCCT GATAAATGCT TCAATAATAT TGAAAAAGGA AGAGTATGAG 
5001 TATTCAACAT TTCCGTGTCG CCCTTATTCC CTTTTTTGCG GCATTTTGCC 
5051 TTCCTGTTTT TGCTCACCCA GAAACGCTGG TGAAAGTAAA AGATGCTGAA 
5101 GATCAGTTGG OTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG 
5151 TAAGATCCTT GAGAGTTTTC GCCCCGAAGA ACCrXTTCCA ATGATGAGCA 
5201 CTTTTAAAGT TCTGCTATGT GGCGCGGTAT TATCCCGTGT TGACGCCGGG 
5251 CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAftTG ACTTGGTTGA 
5301 GTACTCACCA GTCACAGAAA AGCATCTTAC GGATGGCATG ACAOTAAGAG 
5351 AATTATGCAG TGCTGCCATA ACCATGAGTG ATAACACTGC GGCCAACTTA 
5401 CTTCTGACAA CGATCG6AGG ACCGAAGGAG CTAACCGCTT TTTTGCACAA 
54 51 CATGGGGGAT CATGTAACTC GCCTTGATCG TTGGGAACCG GAGCTGAATG 
bl301 AAGCCATACC AAACGACGAG CGTGACACCA CGATGCCTGT AGCAATGGCA 
5551 ACAACGTTGC GCAAACTATT AACTGGCGAA CTACTrACTC TAGCT7CCCG 
5601 GCAACAATTA ATAGACTGGA TGGAGGCGGA TAAAGTTGCA GGACCACTTC 
5651 TQCGCTCGGC CCTTCCGGCT GGCTGGTTTA TTGCTGATAA ATCTGGAGCC 
5701 GGTGAGCGTG GGTCTCGCGG TATCATTGCA GCACTGGGGC CAGATQGTAA 
5751 GCCCTCCCGT ATCGTAGTTA TCTACACGAC GGGGAGTCAG GCAACTATGG 
5901 ATGAACGAAA TAGACAGATC GCTGAGATAG GTGCCTCACT GATTAAGCAT 
5951 TGCTAACTGIT CAGACCAAGT TTACTCATAT ATACTTTAGA TTGATTTGCG 
rj90l GCCGGCCGCA AACTTCATTT TTAATTTAAA AGGATCTAGG TGAAGATCCT 
5S51 TTTTGATAAT CTCATGACCA AAATCCCTTA ACGTGAGTTT TCGTtCCACT 
6001 GAGCGTCAGA CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT 
6051 TTTCTGCGCG TAATCTGCTG CTTGCAAACA AAAAAACCAC CGCTACCAGC 
6101 GGTGGTTTGT TTGCCGGATC AAGAGCTACC AACTCTTTTT CCGAAGGTAA 
6151 CTGGCTTCAG CAGAGCGCAG ATACCAAATA CTGTCCTTCT AGTGTAGCCG 
6201 TAGTTAGCCC ACCACTTCAA GAACTCTGTA GCACCGCCTA CATACCTCGC 
6251 TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC 
6301 TTACCGGGTT GGACTCAAGA CGATAGTTAC CGGATAAGGC GCAGCGGTCG 
6351 GGCTGAACGG GGGGTTCGTG CACACAGCCC AGCTTGGAGC GAACGACCTA 
64 01 CACCGAACTG ACATACCTAC AGOGTCAGCT ATGAGAAAGC GCCACGCTTC 
64 51 CCGAAGGGAG AAAGGCGGAC AOGTATCCGG TAAGCGGCAG GGTCGGAACA 
6501 GGAGAGCGCA CGAGGGAGCT TCCAGGGGGA AACGCCTGGT ATCTTTATAG 
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6551 TCCTGTCGGC5 TTTCGCCACC 

6501 CGTCAGGGGG GCGGAGCCTA 

66bl CGGTTCCTGG CCTTTtGCTG 

6"? 01 ATCCCCTGAT TCTGTGGATA 

6751 CCGCTCGCCG CAGCCGAACG 

680; GCGGAAGAGC GCCAATACGC 

68 51 CATTAATGCA ACTATGGCCA 

6901 AATTAATTTT GATACATGCT 

<?95I AATCACTCAG AGGAGACCCT 

7001 GAAAGACCCC CQCTGACGGG 

7051 GGAACAGCGA GACCACAAGT 

7101 ACACGGGTAC CCGGGCGACT 

7] 51 AGGGGTTGTG GGCTCTTTTA 

7201 GAAGCGAGAA GCGAACTGAI 

/2 51 TCAGGTCCTT GGGGCACCCT 

7 301 TGAGGGCTGG ACCGCATCTG 

7351 GCAGGAACTG CTTACCACAG 

7<101 CATCTGTTCT TGGCCCTGAG 

7 4 Si CTGTTTGGCC CATATTCAGC 

7b 0.1 CTTCTCTATT CTCAGTTATG 

7 551 CTTAAGCTAG CAGATCTGCT 

7 601 ATTCCCCCCT TTTTCTGGAG 

/65X ATTTCCCCGA AAAGTGCCAC 

7 701 CATTAACCTA TAAAAATAGG 

7751 TCCCCGAAAA GTGCCACCTG 

7 801 TAACCTATAA AAATAGGCGT 



tctgacttga gcgtcgattt TTGTCATGCT 
tggaaaaacg ccaocaacgc ggcctttl'ta 
gccttttgct cacatgttct ttcctgcgtt 
accgtattac cgcctttgag tgagctgata 
accgagcgca gcgagtcagt gagcgaggaa 

AAACCGCCTC TCCCCGCGCG TTGGCCGATT 
TTTAATGTAA ATACTTAAGA AAAAAAACCA 
GCATGTGAAG ACCCCCGCTG ACGGGTAGTC 
CCCAAGGCAG CGAGACCACA AGTCGGAAAT 
TAGTCAATCA CTCAGAGGAG ACCCTCCCAA 
CGGATGCAAC TGCAAGAGGG TTTATTGGAT 
CAGTCAATCG GAGGACTGGC GCCCCGAGIG 
TTGAGCTCGG GGAGCAGAAG CGCGCGAACA 
rCGTTAGTTC AflATAAGGCA CAGGGTCATT 
CGAAACATCT GATGGTTCTC TAGAAACTGC 
GGGACCATCT GTtCTTGGCC CTGAGCCGGG 
ATATCCTGTT TGGCCCATAT TCAGCTGTTC 
CCGGGGCAGG AACTGCTTAC CACAGATATC 
TGTTCCATCT GTTCCTGACC TTGATCTGAA 
TATTTTTCCA TGCCTTGCAA AATGGCGTTA 
AGCTTGCCAA ACCTACAGGT GGGGTCTTTC 
ACTAAATAAA ATCTTITATT TTATGCGCAC 
CTGACGTCTA AGAAACCftTT ATTATCATGA 
CGTATCACGA GGCCCTTTCG TCCGCACATT 
ACGrCTAAGA AACCATTATT ATCATGACAT 
ATCACGAGGC CCTTTCGTCC 
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1 
51 
101 
151 

:^oi 

251 
301 
351 
401 
451 
501 
fi5l 
601 
6^1 
701 
7bl 
801 
651 
901 
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1101 
1151 
120} 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
17hl 
1801 
1851 
1901 
1 551 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
23ijl 
2<01 
2451 
2501 
2551 
2601 
265] 
2701 
2751 
2801 
2851 
2901 
29f,l 
3001 
3051 
3101 
3151 
37.01 



TCGAGTTTAC 
ACCACTCCCT 
CTATCAGTGA 
ATAGAGAAAA 
AAGTGAAAGT 
TCGAGTTTAC 
GGTACCCGCG 
CTCGTTTAGT 
GACCTCCATA 
AGTTTAAACG 
actgctgact 
aqaccccaag 
tgtttagtaa 
cldagc^gcac 
tataagcagg 
cacacaggcd 
acctttagct 
tagtgccttg 
gtgggttaaa 
caagcgtcct 
Lgatgtgttg 
aaatgtgttt 
Catgaaaagc 
aaaaaccata 
tTigatagcct 
gatcttttgq 
catagaagaa 
aaatggattc 
cctaaaaaaa 
t acattagca 
ttaat ttgcc 
cagtttttag 
agatttgcct 
atttggargg 
agaacroaaa 
gcctaaaaca 
aagattattt 
agaataattc 
acctgtggct 
aagagagatt 
ttrtaatgtgg 
tgacgai gat 
gggagaagaa 
tcccaaggct 
taatcagcca 
ccaoacctcc 
GGAACAACAA 
TGGGAGGTTT 
TTATGATCCT 
GGTCCCCGGA 
GTACTCGTCA 
AGTCGTGGGG 
TCCAGATCGA 
GCCGTATAAG 
TCGACACTCT 
CCAGCCCCGC 
TCGATGGTAG 
CCCACTTTCA 
GGCCGAATAA 
GCTTGTCGTA 
TTCTTTAGCG 
AATGCCCCAC 
TGTTGGCATA 
TGTAGGCCGT 
AAGCACATCr 



CACTCCCTAT 
ATCAGTGATA 
TAGAGAAAGT 
GTGAAAGTCG 
CGAGTTTACC 
CACTCCCTAT 
TCGAGTAGGC 
GAACCGTCAG 
GAAGACACCG 
GATCCxxxxx 
ctcaacattc 
gactttcctt 
tagaactctt 
tgcfcatacaa 
cataacagtt 
tagagtgtcc 
ttttaatttg 
actagagatc 
ggagcatga^ 
ggaagcttgt 
ttattgcttg 
aaaatgtatt 
attatgcaaa 
tqccaacagg 
acaattaact 
acaggatgga 
tggatggctg 
aqtqgtgtat 
gatactggcL 
gctgctttgc 
cttggacagg 
tagtrtttga 
tcaggtcagg 
cagtgttiaag 
tatttccccc 
ctgcaggcca 
aaagcattgc 
aaagtggcat 
gagtttgctc 
ggacaaagag 
ctatgggaat 
gaagacagcc 
cacggaagac 
catttcaggc 
taccaeattt 
ccctgaacet 
CAATTGCATT 
TTTAAAGCAA 
GCAAGCCTCG 
CGCGCCCTCC 
ATTCCAAGGG 
GGTAAATCCC 
AATCGTCTAG 
TGGAGCTCGT 
GCGCGTGTGT 
CTCTT CGGGG 
ACCCGTAATT 
CATTTAAGTT 
GAAGGCTGGC 
ATAATGGCGG 
ACTTGATGCT 
AGCGCTGAGT 
AAAAGGCTAA 
GTACCTAAAT 
AAAACTTTTA 



CAGTGATAGA 
GAGAAAAGTG 
GAAAGtCGAG 
AGTTTACCAC 
ACTCCCTATC 
CAGTGATAGA 
GTGTACGGTG 
ATCGCCTGGA 
GGACCGATCC 
xxxxxxargc 
tactcctcca 
cagaattgct 
gcttgctttg 
gaaaattatg 
ataatcataa 
gctattaata 
taaaggggtt 
cattttctgt 
tttaatccag 
aacagagtat 
ggatgcactt 
aaaaaagaac 
tgctgctata 
ctgtrgatac 
agagaacaaa 
tataatgttr 
gagttgcttg 
gactttttaa 
gcttaaagga 
ttgaattatg 
ctgaactttg 
ggatgtaaag 
gaartaataa 
gtaaacttag 
tggaatagtc 
gatttgtaaa 
ctggaacgca 
cgctttgctt 
aaagtattca 
tttagtttgt 
tggagtttta 
aggaaaatgc 
tcagggcatg 
ccct cagtcc 
gtagaggttt 
gaaacataax 
CATTTTATGT 
GTAAAACCTC 
TCGTCTGGCC 
ATGAGCAGAG 
CATCGGTAAA 
GGACCCGGGG 
CGCGTCGGCA 
CCCCCAGGCT 
CCGCGGGGAG 
GCGTCGTCGT 
GTTTTTCGTA 
GTriTTCTAA 
TCTGCACCTT 
CATACTATCA 
CTTGATCTTC 
GCATATAATG 
TTGATTTTCG 
GTACTTTTGC 
GCGTTATTAC 



GAAAAGTGAA AGTC6AGTTT 
AAAGTCGAGT TTACCACTCC 
TTTACCACTC CCTATCAGTG 
TCCCTATCAG TGATAGAGAA 
AGTGATAGAG AAAAGTGAAG 
GAAAAGTGAA AGTCGAGCTC 
GGAGGCCTAT ATAAGCAGAG 
GACGCCATCC ACGCTGTTTT 
AGCCTGCGGC CGCTTAATTA 
catctagtga tgatgaggct 
aaaaagaaga gaaaggtaga 
aagttttttg agtcatgctg 
ctatttacac cacaaaggaa 
gaaaaatatt ctgtaacctt 
catactgttt tttcttactc 
actatgctca aaaattgtgt 
aataaggaat atttgatgta 
tattgaggaa agtttgecag 
aagaagcaga ggaaacta^a 
gcaatggaaa caaaatgcga 
ggaatttcag tacagttttg 
agcccagcca ctataagtac 
ttcgctgaca gcaaaaacca 
tgttttagcfc aaaaagcggg 
tgttaacaaa cagatctaat 
ggttctacag gctctgctga 
gctacactgt ttgttgccca 
aatgcatggt gtacaacatt 
ccaattgata gtggtaaaac 
cggggggaaa gctttaaatg 
agctaggagt agctattgac 
ggcactggag gggagtccag 
cctggacaac ttaagggatt 
aaaagaaaca cctaaataaa 
accacgaatg agtacagrgt 
acaaatagat tttaggccca 
gtgagttttt gttagaaaag 
cttatgttaa tttggtacag 
qagcagaatt gtggagtgga 
cagtgtatca aaaaatgaag 
gattggctaa gaaacagtga 
tgataaaaat gaagatggtg 
aaacaggcat tgattcacag 
tcacagtctg ttcatgatca 



tacttgcttt aaaaaacctc 
xxxxxxxxxx ggmtacCCCG 
TTCAGGTTCA GGGGGAGGTG 
TACAAATGTG GTATGGCTGA 
GGACCACGCT ATCTGTGCAA 
CGTCGCGCCC CCTACCCACC 
CAGAGCGCCG TAGGGGGCGG 
AATCCCCGTC CCCCAACATG 
TGCGCCATCG CCACGTCCTC 
GACATCGGTC GGGCGG6CCG 
AAAGGACAGG CGCGGAGCCG 
CCGGGAGATC GAGCAGGCCC 
CGCGCGCGGC TGTACGCGGA 
TCCGCATATG ATCAATTCAA 
GGTGATCAAA TAATTCGATA 
GTAGTAGGTG TTTCCCTTTC 
CAATACGCAA CCTAAAOTAA 
CATTCTCTAG TGAAAAACCT 
AGAGTTTCAT ACTGTTTTTC 
TCCATCGCGA TGACTTAGTA 
GTAAAAAATC TTGCCAGCTT 
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32 S] TCCCCTTCTA AAGGGCAAAA GT(3AGTATGG TGCCTATCTA ACATCTCAAT 

3 301 GGCTAAGGCG TCGAGCAAAG CCCGCTTATT TTTTACATGC CAATACAATG 

3*^bl TAGGCTCCTC TACACCTAGC TTCtGGGCGA GTTTACGGGT TGTTAAACCT 

3>ini TCGATTCCGA CCTCATTAAO CAGCTCTAAT GCGCTGTTAA TCACTTTACT 

34 51 TTTATCTAAT CTAGACATGG TGGAAGCTTT TTGCAAAAGC CTAGGCCTCC 

hbOl AAAAAAGCCT CCTCACTACT TCTGGAATAG CTCAGAGGCC GAGGCGGCCT 

3S=il CGGCCTCTGC ATAAATAAAA AAAATTAGTC AGCCATGGGG CGGAGAATGG 

3601 GCGGAACTGG GCGGAGTTAG GGGCGGGATG GGCGGAGTTA GGGGCGGGAC 

3651 TATGGTTGCT GACTAATTGA GATGCATGCT TTGCATACTT CTGCCTGCTG 

3701 GGGAGCCTGG GGACTTTCCA CACCTGCTTG CTGACTAATT CAGATGCATG 

'■4751 CTTTGCATAC TTCTGCCTGC TGGGGAGCCT GGGGACTTTC CACACCCTAA 

3901 CTGACACACA TTCCACAGGT CGACTAGATC GAATTCTCAA TTGTTTTACG 

3951 CGGCCCGATG CATGGGCTCG TGCGCTCCTT TCGGTCGGGC GCTGCGGGTC 

3 901 GTGGGGCGGG CGTCAGGCAC CGGGCTTGCG GGTCATGCAC CAGGTCGCGC 
3951 GGTCCTTCGG GCACTCGACG TCGGCGGTGA CGGTGAAGCC GAGCCGCTCG 

4 001 TAGAAGGGGA GGTTGCGGGG CGCGGAGGTC TCCAGGAAGG CGGGCACCCC 
4 051 GGCGCGCTCG GCCGCCTCCA ctccggggag cacgacggcc ctgcccagac 
4101 CCTTGCCCTG GTGGTCGGGC GAGACGCCGA CGGTGGCCAG GAACCACGCG 
4151 GGCTCCTTGG GCCGGTGCGG CGCCAGGAGG CCTTCCATCT GTTGCTGCGC 
1?.0l GGCCAGCCGG GAACCGCTCA ACTCGGCCAT GCGCGGGCCG ATCTCGGCGA 
42 51 ACACCGCCCC CGCTTCGACG CTCTCCGGCG TGGTCCAGAC CGCCACCGCG 
4 301 GCGCCGtCGT CCGCGACCCA CACCTTGCCG ATGTCGAGCC CCACGCGCGT 
4 351 GAGGAAGAGT TCTTGCAGCT CGGTGACCCG CTCGATGTGG CGGTCCGGAT 
4 401 CGACGGTGTG GCGCGTGGCG GGGTAGTCGG CGaACGCGGC GGCGAGGGTG 
44 51 CGTACGGCCC TGGGGACGTC GTCGCGGGTG GCGAGGCGCA CCGTGGGCTT 
4 501 GTACTCGGTC ATGGTAAGCT GATCCGGCCG GCGCCTAGAG AAGGAGTGAG 
4551 GGCTGGATAA AGGGAGGATT GAGGCGGGGT CGAAAGAGGA ggttcaaggg 

4 601 ggagagacgg cgcggatgga agaagaggag gcggaggctt agggtgtaca 
4 6bl aagggcttga cccagggagg ggggtcaaaa gccaaggctt cccaggtcac 
4 701 gatgtagggg acctggtctg ggtgtccatg cgggccaggt gaaaagacct 

4 7 51 TGATCTTAAC CTGGGTGATG AGGTCTCGGT TAAAGGTGCC GTCtCGCGGC 

4 001 CAtCCGACGT TAAAGGTTGG CCATTCTGCA GAGCAGAAGG TAACCCAACG 

4 851 TCTCTTCtTG ACATCTACCCi ACTGGTTGTG AGCGAGCCGC TCGACATCTT 

4 901 TCCAGTGATC TAAGGTCAAA CTTAAGGGAG TGGTAACAGT CTGGCCCTAA 

4 951 TTTTCAGACA AATACAGAAA CACAGO'CAGA CAGAGACAAC acagaacgat 

5001 gctgcagcag acaagacgcg cggcttcggt tccaaaccga aagcaaaaat 

5051 tcagacggag gcgggaactg ttttaggttc tcgtctccta ccagaaccac 

i>101 atatcctgac ggggtcggat tccacatcga ctcccttcct caggtcgggc 

5151 cacaaaaacg gcccccaaag tccctgggac gtctcccagg gttgcggcicg 

5201 ggtgttcaga actcctcagt tccaccacgg gtccgccaga tacagagcta 
gttagctaac tagtaccgac gcaggcgcat aaaatcagtc atagacacta 

5301 gacaatcgga cagacacaga taagttgctg gccagcttac ctcccggtgg 

b3 51 tgggtcggtg gtccctgggc aggggtctcc cgatcccgga cgagccccca 

h4 01 aatgaaagac ccccgctgac gggtagtcaa tcactcagag gagaccctcc 

54 51 CAAGGAACAG CGAGACCACA AGTCGGATGC AACTGCAAGA GGGTTTATTG 

'J 5 01 GATACACGGG TACCCGGGCG ACTCAGTCAA TCGGAGGACT GGCGCCCCGA 

5551 GTGAGGGGTT GTGGGCTCTT TTATTGAGCT CGGGGAGCAG AAGCGCGCGA 

S601 ACAGAAGCGA GAAGCGAACT GATTGGTTAG TICAAATAAG GCACAGGGTC 

5651 ATTTCAGGTC CTTGGGGCAC CCTGGAAACA TCTGATGGTT CTCTAGAAAC 

b7 01 tGCTGAGGGC TGGACCGCAT CTGGGGACCA TCTGTTCTTG GCCCTGAGCC 

57 51 GGGGCAGGAA CTGCTTACCA CAGATATCCT GTtTGGCCCA TATTCAGCTG 

5801 TTCCATCTGT TCTTGGCCCT GAGCCGGGGC AGGAACTGCT TACCAQAGAT 

5S51 AtCCTGTTTG GCCCATATTC AGGCTGCAGG TGGCACTTTT CGGGGAAATG 

S901 TGCGCGGAAC CCCTATTTGT TTATTTTTCT AAATACATTC AAATATGTAT 

G951 CCCCTCATGA GACAATAACC CTGATAAATG CTTCAAtAAT ATTGAAAAAG 

6001 GAAGAGTATG AGTA'JTCAAC ATTTCCGTGT CGCCCTTATT CCCTTTTTTG 

6051 CGGCATTTTG CCTTCCTGTT TTTGCTCACC CAGAAACGCT GGTGAAAGTA 

6101 AAAGATGCTG AAGATCAGTT GGGTGCACGA GTGGGTTACA TCGAACTGGA 

6151 TCTCAACAGC GGTAAGATCC TTGAGAGTTT TCGCCCCGAA GAACGTTTTC 

6201 CAATGATGAG CACTTTTAAA GTTCTGCTAT GTGGCGCGGT ATTATCCCGT 

6251 GTTGACGCCG GGCAAGAGCA ACTCGGTCGC CGCATACACT ATTCTCAGAA 

6301 TGACTTGGTT GAGTACTCAC CAGTCACAGA AAAGCATCTT ACGGATGGCA 

^>3hl TGACAGTAAG AGAATTATGC AGTGCTGCCA TAACCATGAG TGATAACACT 

6 401 GCGGCCAACT TACTTCTGAC AACGATCGGA GGACCGAAGC AGCTAACCGC 

6451 I'TTTTTGCAC AACATGGGGG ATCATGTAAC TCGCCTTGAT CGTTGGGAAC 

650 i CGGAGCTGAA TGAAGCCATA CCAAACGACG AGCGTGACAC CACGATGCCT 

6551 CTAGCAATGG CAACAACGTT GCGCAAACTA TTAACTGGCG AACTACTTAC 
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6601 TCTA6CTTCC CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG 

G651 CAGGACCACT TCTGCGCTCG GCCCTTCCGG CTGGCTGGTT TATTGCTGAT 

6701 AAATCTGGAG CCGGTGAGCG TGGGTCTCGC GGTATCATTG CAGCACTGGG 

6751 GCCAGATGGT AAGCCCTCCC GTATCGTAGT TATCTACACG ACGGGGAGTC 

6801 AGGCAACTAT GGATGAACGA AATAGACAGA TCGCTGAGAT AGGTGCCTCA 

6851 CTGATTAAGC ATTCGtAACT GTCAGACCAA GTTTACTCAT ATATACTTTA 

5901 GATTGATTTG CGGCCGGCCG CAAACTTCAT TTTTAATTTA AAAGGATCTA 

6951 GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT 

7001 TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT 

7031 TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 

7101 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 

7151 TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 

7201 CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC 

/251 TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 

7 301 ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG 

7351 GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 

?401 GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA 

lAbl GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC 

7 501 AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 

7 551 GTATCTTTAT AGTCCTGTOG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT 

7 601 TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 

7 651 GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTtTTG CTCACATGTT 

7701 CTTTCCTGCG TTATCCXTCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG 

77'Jl AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA 

7801 GTGAGCGAGG AAGCGGAAGA GCGCCAAfAC GCAAACCGCC TCTCCCCGCG 

7851 CGTTGGCCGA TTCATTAATG CAACTATGGC CATTTAATGT AAATACTTAA 

7 901 GAAAAAAAAC CAAATTAATT TTGATACATG CTGCATGTGA AGACCCCCGC 
7951 TGACGGGTAG TCAATCACTC AGAGGAGACC CTCCCAAGGC AGCGAGACCA 
8001 CAAGTCGGAA ATGAAAGACC CCCGCTGACG GGTAGTCAAT CACTCAGAGG 
8051 AGACCCtCCC AAGGAACAGC GAGACCACAA GTCGGATGCA ACTGCAAGAG 
6101 GGTTXATTGG ATACACGGGT ACCCGGGCGA CTCAGTCAAT CGGAGGACTG 
8151 GCGCCCCGAG TGAGGGGTTG TGGGCTCTTT TATTGAGCTC GGGGAGCACA 
8201 AGCGCGCGAA CAGAAGCGAG AAGCGAACTG ATTGGTTAGT TCAAATAAGG 
8251 CACAGGGTCA TTTCAGGTCC TTGGGGCACC CTGGAAACAT CTGATGGTTC 

8 301 TCTAGAAACT GCTGAGGGCT GGACCGCATC TGGGGACCAT CTGTTCTTGG 
^3 51 CCCTGAGCCG GGGCAGGAAC TGCTTACCAC AGATATCCTG TTTGGCCCAT 
8 4 01 ATTCAGCTGT TCCATCTGTT CTTGGCCCTG AGCCGGGGCA GGAACTGCTT 
8451 accacagata tcctgtttgg CCCATATTCA GCTGTTCCAT CTGTTCCTGA 
8501 CCTTGATCTG AACTTCTCTA TTCTCAGTTA TGTATTTTTC CATGCCTTGC 
8551 AAAATGGCGT TACTTAAGCT AGCAGATCTG CTAGCTTGCC AAACCTACAG 
8601 gtcgggtctt TCATTCCCCC CTTTTTCTGG AGACTAAATA aaatctttta 
8651 TTTTATGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA 
3701 TTATTATCAT GACATTAACC TATAAAAATA CGCGTATCAC GAGCCCCTrT 
8751 CGTCCGCACA TTTCCCCGAA AAGTGCCACC TGACGTCTAA GAAACCATTA 
8801 TTATCATGAC ATTAACCTAT AAAAATAGGC GTATCACGAG GCCCTTTCGT 
8851 CC 



Fig. 9 




mmmmmm 



- 28/35 - 



puhdlO-3-HIT.3 Length: 3621 

i ctcgagttta ccactcccta tcagtgatag agaaaagtga aa^tcgagtt 

bl taccactcoc tatcagtgat agagaaaagt gaaagtcgag tttaccactc 

101 Gctatcagtg atagaga aa gtgaaagtcg agtttaccac tccctatcaq 

151 tgatagagaa aaqtgaaagt cgagtttacc actccctatc agtgatagag 

201 aaaagtgaaa gtcgagttta ccactcccta tcagtgatag agaaaagtga 

261 aagtcgagtt taccactcec tatcagtgat agagaaaagt gaaagtcgag 

301 ctcggtaccc gggtcgagta ggcgtgtacg gtgggaggcc tatataagca 

351 gagctcgttt agtgaaccgt cagatCQCct ggagacgcca tccacgctgt 

4 01 tttgacctcc atagaagaca ccgggaccga tccagcctcc gcggccccga 

Abl attaaacagt cgagctacgt caacgaaaaa caaaatccaa acatgagccg 

£01 cctgcccgtc ctgctcctgc tccaactcct ggtccgcccc ggactccaag 

iSSl ctcccatgac ccagacaacg tccttgaaga caagcrgggt taactgctct 

601 aacatgatcg atgaaattat aacacactta aagcagccac ctttgccttt 

651 gctggacttc aacaacctca atggggaaga ccaagacatt ctgatggaaa 

7 01 ataaccttcg aaggccaaac ctggaggcat tcaacagggc tgtcaagagt 

7 51 ttacagaacg catcagcaat tgagagcatt cttaaaaatc tcctgccatg 

801 tctgcccctg Qccacggccg cacccacgcg acatccaatc catatcaagg 

851 acggtgactg gaatgaattc cggaggaaac tgacgttcta tctgaaaacc 

901 cttgagaatg cgcaggctca acagacgact ttgagcctcg cgatctttta 

951 gaactcgact ctagacatga taagatacat tgatgagttt ggacaaacca 

lOOi caactagaat gcagtgaaaa aaatgcttta tttgtgaaat ttgtgatgct 

1051 attgccttat ttgtaaccat tataagctgc aataaacaag ttaacaacaa 

1101 caattgcatt cattttatgt ctcaggttca gggggaggtg tgggaggttt 

1151 cttaaagcaa gtaaaacctc tacaaatgtg gtatggctga ttatgatcct 

1201 gcaagcctcg tcgtctggcc ggaocacgct atctgtgcaa ggtccccgga 

1251 cgcgcgctcc atgagcagag cgcccgccgc cgaggcaaga ctcgggcggc 

1301 gccctgcccg tcccaccagg tcaacaggcg gtaaccggcc tcttcatcgg 

1351 gaatgcgcgc gaccttcagc atcgccggca tgtcccctgg cggacgggaa 

14 01 gtatcagctc gaccaagctt ggcgagattt tcaggagcta aggaagctaa 

1451 aac:g9agaaa aaaatcactg gatataccac cgttgatata tcccaatggc 

1501 atcgtaaaga acattttgag gcatttcagt cagttgctca atgtacctat 

1551 aaccagaccg ttcagctgca ttaatgaatc ggccaacgcg cggggagagg 

1601 cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 

1651 gctcggtcgt tcggctgcgg cgagcggtat cagctcacto aaagtcggta 

1701 atacggttat ccacagaatc agggcrataac gcaggaaaga acatgtgagc 

17 hi aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt 

1801 tttrccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca 

IB 51 agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 

1901 ccctggaagc tccctcgtigc gctctcctgt tccyaccctq ccgcttaccg 

1951 gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc 

2001 tcacgcrgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg 

2051 utgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta 

2101 actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggaa 

2lbl gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 

2201 aqagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat 

2251 ttggratctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 

2301 agctcttgat ccggcaaaca aaccaccgct ggtaqcggtq gtttttttgt 

2351 ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 

24 01 cgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa 

24 51 gggattttag tcatgagatt atcaaaaagg atctTicacct agatcctttt 

2501 aaattaaaaa cgaagtttta aatcaatcta aagtatatat gagtaaactt 

2551 ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 

2601 tgtctatttc gtrcatccat agttgcctga ctcccogtog tgtagataac 

2651 tacgatacgg gagggcttac cacctggccc cagtgctgca atgataccgc 

2701 gagacccaeg ctcaccggct ccagatttat cagcaataaa ccagecagcc 

2751 ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca 

2901 gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 

2951 gtttgcgcaa cgttgttgcc attgctacag gcatcgtgtg gtcacgctcg 

2901 tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt 

2 951 tacatgatcc cccatgctgt gcaaaaaagc ggttagctcc ttcggtcctc 

3001 cgatcgttgt oagaagtaag ttggccgcag tgtratcact catggttatg 

3051 gcagcactgc ataattctct tactgrcatg ccatccgtaa gatgcttttc 
3101 tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 
3151 qaccgagttg ctcttgcccg tcgtcaatac gggataatac cgcgccacat 

3201 aqcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa 



it . uc 



3301 
3351 
34C1 
34!j1 
3501 
3551 
3601 



actctcaagg 
gtgcacccaa 
tgagcaaaaa 
acggaaatqt 
1 1 tat caggg 
aaaaataaac 
tgacgrctda 
gtaLcacgag 



atcttaccgc 
ctgatcttca 
caggaaggca 
tgaatactca 
ttatugtctc 
aaataggggt 
gaaaccatta 
gcnctt ccqt 
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tgttgagatc 
gcatctttta 
aaatgccgca 
tactcttcct 
atgagcggat 
tccgcgcaca 
ttatcatgac 
c 



cagttcgatg 
ctttcaccag 
aaaaagg-gaa 
ttttcaatat 
acatatttga 
tttccccgaa 
attaacctat 



taacccactc 
cgtttctqgg 
taagggcgac 
tattgaagca 
atgtatttag 
aagtgccacc 
aaaaataggc 
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31 taccactccc Intcaqtgat agagaaaagt qaaagtcgag tttatcactr- 

101 cctatcagr.g atagagaaaa gtgaaagtcg agtttaccac t-jcctatcag 
tgatagagaa aagtgaaagt cgagtttacc actccctatc agtgatagag 

:;01 ikriaaqtgaaa gtctjagttta ccactcccta tcagtgatag agaaaagtga 

21^1 aagtcgagct taccaocccc tatcagtgat agagaaaagt gaaagtcgag 

301 rtcqytfSCcc qggtc:gaqta ggcgtgcacg gtgggaggcc tatataagca 

'itil yagctcgttt agtgaaccgt cagatcgcct ggagacgcca tccacgctgt 

4 01 tttgacctcc atagaaqaca ccgggaccga tccagcctcc gc:ggtggcgg 

451 ccgctc^aga acLagtggat cccccagctt acctgccatg ccagtacccc 

501 cagqagaaga ttccaaagat gtagccgccc cacacagaca gccactcacc 

55) tcttc;agaac gaattgacaa acaaattcgg tacatcctcg acggcacctc 

601 agccctqaga aaggagacat gtaacaagag taacatgT:gt gaaagcagca 

6bl adgaggcacc ggcagaaaac aacctgaacc ttcr.aaagat ggctgaaaaa 

iir qatggatgc;. tccaatccgg attcaatgag gagactcgcc tggtgaaaat 

-751 catcacti^gt cttvtggagt ttgaggtata cctagagtac ctccaqaaca 

H£U gatttgagaq tagtgaqgaa caagccagag ctgtsccagat gagtacaaaa 

951 gtccfcqatcc agtucctgca gaaaaaggca aagaatctag atgcaataac 

901 cacccctqac ccaaccacaa atgccagcct qctgac:gaag ctgcaggcac 

951 agaaccagtg qctgcaggac atgacaactc atctcattct gcgcagcttt 

1001 ^iaggaqttcc tgcagtccag cctgagggct cttcggcaaa tgtagtaagg 

1051 aT.ccgaattc gagctoggta cccggggatc ctctagagga tccagacatg 

T 101 ataagataca ttg^itgagtt tggacaaacc acaactagaa tgcagtgaaa 

11 'il aaaar^gcttt .ir^ttgtgaaa tttgtgatgc tattgcttta txtgtaacca 

1201 trataagctg caataaacaa gttaacaaca acaattgcat tcattctatg 

1251 ti-tsaqqtrc agqqggaggt gtgggaggtt ttttaaagca agtaaaacct 

l-aOl ci-acnaatql ggtatqgctg attatgatcc tqoaagcctc gtcgtctggc 

1351 cqgaccacgc tacctgtgca aggtccccgg acgcgcgctc catgagcaga 

1401 gcgcccqccq r.cgaggcaag actcgggcgg cgccctgccc gtcccaccag 

1451 gtcaao^?ggc qqtaaccggc ctcttcatcg ggaaxgcgcg cgaccttcag 

IhOl catcgccqqc otgtcccctg gcggacggga agtatcagct cgaccaagct 

155"] tggcqagatt ttcaggagct aaggaagcta aaatggacaa aaaaatcact 

1601 ggaratacoa ncgttqatat atcccaatgg caLcgtaaag aacattttga 

1651 ggcatLCcag tcagttgctc: aatgtaccta taaccagacc gtitcagctgc 

1701 riftaatqaat cgqccaacgc qcggggagag gcggtttgcg tattgggcgc 

17hi tcttccgctt ccTtcgctcac tgactcgctg cgctcggtcg ttcggctgcg 

1801 gcgaqcggta tcagctcact caaagtcgqt aatacgqtta tccacagaat 

18 hi caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc 

1901 agqaaccgta aaaaggccgc gttgctggcg ttttrccata ggctccgccc 

1951 ccctgacgag catuacaaaa atcgacgctc aagtcagagg tggcgaaacc 

2001 cqacaggact araaagatac caggcgtttc cccctggaag ciccctcgtq 

2051 cgctctcctg ttucgaccct gccgcttacc ggatacctgt ccgcctttcr 

^'.lOl cccttcgqqa agcgtqgcgc tttctcaatg ctcacgcrgt aggtatctca 

2151 gttcggtgta qgtt^gttcgc tccaagctgg gctgtgtgca cqaacccccc 

:^201 gctcagcccg accgctgcgc ctt-atccggt aactatcgtc ttgagtccaa 

27'ol cc^ggtaaga cacgacttat cgccactgga agcagccact ggraacagga 

2301 ttaqcsigagc gaggtatqtd ggcggtgcta caqagctctt qaagtggtgg 

2-J&1 ccraactacg gctacactag aaggacagta r.ttggtatct gcgctctgct 

2401 uaagccaqtv accttcggaa aaagagttgq cagctcttga tccggcaaac 

2451 aaacc^accgc tggtagcggx ggtttvtttg tttgcaagca goagattacg 

2501 cgcagaaaaa aaqgatctca agaagatcct ttgatctttt ctacggggtc 

2551 tqacgrtcag T^ggaacgaaa actcacgtta agggattttg gtcatgagat 

2601 tatcaaaaag gar.cttcacc tagatccttt taaattaaaa atgaagtttt 

2651 aaatcadtnt aaagcatata igagtaaaci, tggtctgaca qttaccaatg 

?101 clLaatcaqt gaggcaccta tctcagcgat otgtctattt cgttcarcca 

2751 taqtlgcctq actccccgtc gtgtagataa ctacgatacg ggagggctta 

2a01 rcaTatogc:c: ccagtgctgc aatgataccg cgagacccac gctcar.cggc 
Lcoagatttd tcagcaahaa accagccagc cggaagggcc gagcgcagaa 

2901 gtqgr.cctgc aactttatcc gcctccatcc agtctattaa ttgttgccgq 

2951 gaaqctagag taagfcagttc gccagttaat aqtttgcgca acgttgttgc 

3001 nattgctaca qqcatcgtgt ggtcacgctc gtcgtttggt atggcx^tcat 

•^O'jl I f:agctcr.-09 ttccca^cga ccaaggcqag ttacargacc ccccaigtcg 

:jl01 igcaaaaaag cqgttiigctc ctccggtcct ccgatcgtcg tcagaagtaa 

jl51 :ir.tgqccyc:a gtqttat.cac tcatggttat ggcagcactg cataactctc 
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3201 iiLact^tcat ycuatccota 

3251 accaagtcat tctgagaata 

33n3 gtcgtcaata cgggataata 

3351 tcatcattgg aaaacgctct 

3401 ctgttgagat ccagttcgat 

.^451 agcatcrttt actttcacca 

3501 aaaatgccgc aaaaaaggga 

3S!j1 /itactcttcc tthttcaata 

3601 catgagcqga cacatatttg 

36:>l ttccgcgcac atttccccga 

3701 attatcatga cattaaccta 

3751 tc 
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agatgctttt ctgtgactgg tgagtactca 
gtgtatgcgg cgaccgagtt gctottgccc 
ccgcgccaca tagcaqaact ttaaaagtgc 
tcggggcgaa aactctcaag gaccttaccg 
gtaacccact cgtgcaccca actgatcttc 
gcgcttctgg gtgagcaaaa acaggaaggc 
ataagggcga cacggaaatg ttgaatactc 
ttattgaagc atttatcaqg gttattgtct 
aatgtattta gaaaaataaa caaacagggg 
aaagtgccac ctcjacgtcta aqaaaccatt 
taaaaatagg cgtatcacga ggccctttcg 
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crcgaqttiarcactccccatcagtqatagagaaadgtgaaagtcgagttcaccactc^^ 
i + 

tatcagtgat^gagaaaagtgaaagtcgagt« ^20 



61 



121 
181 
241 

30 

361 



5A 
60 
661 



Qtgaaagtcgagtttaccactccctatcagtgatagagaaaagtgaaagtc^ 

— — + — 4 " + 

actccctatcagtqatagagaaaagtgaaagtcgagtttaccactccctat^^^ 
— -I- — ' 1- ' ^ 

agaaaa<,tgaa9gtcgagtttaccactccctat.cagt<3atagaga«agtgaaagtcgag 

+ + 

ctcggtacccgggtcgagtaggcgtgtacggtgggaggcctataraagcagagctcgttt 

1 + ^ 

agtgaacc«tcagatcgcctggagacgccatccacgctgt« 

CCgggaccqatCCagcctccgcggccccgaattcctgcagcccATGCACTTGCAAAGGGC 

21 + + + + - 

TCTGGTAGTCC'IGGCCCTGCTGAACTTGGCCACAATCAGCCTCTCTCTGTCCACTTGCAC ^ ^ ^ 

81 + -+ + *" 

CACGTTGGftCTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCC^ 

GAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCATCGGTGATGACCCACGTCCCCTATCA 

i f 1 + 

GGTCCTGGCACTTTAr^CAGCACCCGGGAGTTGCTGGAAGAGATGCACGGGGAGAG^ 

^. +- . + + 

GGAAGGCTGCACTCAGGAGACCTCGGAGTCTGASTACTATGCCAAAGAGATCCATAAATT 

721 + + ^ * 

CGAO^TGATCCAGGGACTGGCGGAGCACAATGAACTGGCCGTCTGCCCCAAASGAAT^ ^ ^ ^ 

781 ♦ ■*■ 

CTCTAAGGTTTTTCGTTTCAATGTGTCCTCAGTGGAGAAAAATGGAACCAATCT 

a4i * ' — *— 

GCCAGAGTTCCCGGTCTTGCOGGTGCCCAACCCCAGCTCCAAGCeCACAGAGCAG^^ 

901 * ♦ * 

TGAGCTCTTCCAGATACTTCGACCGGATGAGCACATAGCCAAGCAGCGCTACATAGGTGG ^^^^ 

961 + * + * 

CAAGAATCIGCCCACAAGGGOCACCGCTGAATGGCTGTCTTTCGATGTC^^^ ^^^^ 

1021 ' + 

GCGCGAGTGGCT0TTGAG<5AGAGAGTCCAACTTGGGTCTGGAAATCAGCA'PCCACTGTCC ^^^^ 

1081 + + 

ATGTCACACCTTTCAGCCCAATGGAGACATACTGGAAAATGrTCATGAGGTGM ^^^^ 

1141 + + ^ 

CAAATTCAAAGGAGTGGACAATGAftGATGACCATGGCCGTGGAGACCTGGGGCGTC^^ ^^^^ 

+ + * 

GAAGCAAAAGGATCACCACMCCCACACCTGATCCTa^TGATGATCCCCCC^^ ^^^^ 
+ + — ---1- 
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1261 
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GCACACCCCACGCCAGGGCAGTCAGAGGAAGAAGAGGGCCCTGGACACCAATTACTGCTT 
1321 + + + — + 1380 

CCGCAACCTGGAGGAGAACTGCTGTGTACGCCCCCTTTATATTGACTTCCGGCAGGATCT 
1381 -+ + + + + 1440 

AGGCTGGAAATGGGTCCACGAACCTAAGGGTTACTATGCCAACTTCrGCTCAGGCCCTTG 
1441 + + + + + 1500 

CCCATACCTCCCCAGCGCAGACACAACCCATAGCACGGTGCTTGGACTATACAACACCCT 
1501 ^ + ♦ + + 1S60 

GAACCCAGAGGCGTCTGCCTCGCCATGCTGCGTCCCCCAGGACCTGGAGCCCCTGACCAT 
xS61 + 1^ + + + + 1620 

CTTGTACTATGTGGGCAGAACCCCCAAGGTGGAGCAGCTGTCCAACATGGTGGTGAAGTC 
1621 + + * + 1680 

GTGTAAGTGCAGCTGAgggggatccactagttctagaggacccagacatgataagataca 
168] + + + + + 1740 

ttgatgagtttqgaoaaaccacaaccagaatgcagtgaaaaaaatigctttatttgtgaaa 
17<31 — -+ + " + ■ + + + 1800 

tttgtgatgctattgct ttdtttgtaaccattstaagctqcaataaacaagttaacadca 
1801 ^- + + 4- + + 1B60 

acaattgcactcattttatgtttcaggttcagggggaggtgtgggaggttttttaaagM 
1861 + 4 + + + + 1920 

agtaaaacc:tctacaaatgtggcatggctgatt:aT:gei;cctgcaagcctcgtcgtctggc 
1921 + + + -i- + + 1980 

cggaccacqctatctgtgcaaggtccccggacgcgcgctccatgagcagagcgcccgccg 
1981 + + + + + + 2040 

ccgaggcaagactcgggcggcgccctgcccgtcccaccaggtcaacaggcggteaccggc 
2041 + + + + + + 2100 

ctcttcatcgggaatgcgcgcgaccttcagcatcgccggcatgtcccctggcggacggga 
2101 — -+ + "+ ' + + + 2160 

agtatcagctcgaccaagcttggcgagattttcaggagctaaggaagctaaaatggagaa 
2iei + + + + + + 2220 

aaaaaccdctggatataccaccgttgatatatcccaatggcatcqtaaagaacattttga 
2221 + + + ^ ' + 2290 

gqcat ttcagr cagrtgctcaargtacctataaccagaccgttcagctgcattaatgaat 
2281 — ^ + + "+ 2340 

cggccaacgcgcgqqgagoggcggtttgcgcattgggcigctcttccgct Ccctcgctcac 
2341 ^ + + + ^ + 2400 

tgactcgcrgcgctcggccgttcqgctgcggcgagcggcatcagctcactcaaagtcggc 
2401 + • + ' + . — + ^ + 2460 

aatacggrtatcceicagaatcaggggataacgcaggaadgaacatgtgagcaaaaggcca 
2461 + + + — + ■ + 2520 

gcdaaagqccttggaaccgtaaaaaggccgcgtt-gctggcgtttttccataggctccgccc 
2b21 + ' 1- ^ + ^ 2580 

ccctgacgagcat cacaaaaaccgacgctcaagtcagaggtggcgaaacccgacaggact 
'^S^X + • + *1— + 4- +- + 2640 
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262 
2941 
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ataaagaiaccaggcgtttccccctggaagctccctcgtgcgct^^ ^^^^ 

2641 ^ 

gccgcttaccgqaLacctgtccgcctttctcccttcgggaagcgtgqcgctttctc^^ 

2701 + + - + — -f ^ 

ctcacgctgtaggtaLCtcagttcggtgtagcjtcgttcgctccaagctgggctg ^^^^ 

;?7M + ' * ' ^ 

ccjaacccr-ccgttcagcccgaccgctgcgcctt^ ^^^^ 

1 •< — ---- + ~ 

cccggtaagacacqauttatcgccactggaagcagccactggtaacaggattaacaga^ ^^^^ 
gaggtatgtagQcggtgctacagagttcttgaagtggtggcctaactacggctacac^^ ^^^^ 

aaggacagtatttggcatctgcgctctgctgaagccagfctaccttcggaaaaagagttg^ ^^^^ 

3001 — + + ' * 

Lagctcttgatccygcaaacaaaccaccgctggtagcggtggtttttttgttcgcaagca 

3061 * + * 

gcagattacgcqcagaaasaaaggatctcaagaagatcctttgatctttectacggggt^ ^^^^ 
3121 ♦ + * 

Lgacgctcagtggaacqaaaactcacgrtaagggattttggtcatgagattatcaaaaag ^^^^ 

3181 + ' ^ 

gatcttcacctagatccttttaaattaaaaatgaagtttraaatcaatctaaagtarata ^^^^ 

3241 + + *" """^ 

tgagtaaacttqqtotgacagttaccaatgcttaatcagtgaggcacctacctcagcgat ^^^^ 

3301 + + * * 

CLgtctatttcqttcacccatagttgcctgactccccgtcgtgtagataactacgatacg ^^^^ 

.^;^6l + "* 

ggagggcttaccatcT.ggccccagtgctgcaatgataccgcgagacccacgctcaccggc ^^^^ 

3421 + + ■♦■ ~ * 

tccagattcatcagcaataaaccagccagccggaagqgccgagcgcagaagtggtcctgc 

3481 ' + + *~" 

aactttatccgcctccatccagtctattaattgttgccgggaagctagagtaagtagttc 

3541 ' + * 

gccagttaacagtttgcgcaacgctgtcgccattgctacaggcatcgtgtggtcacgc^^ ^^^^ 

3601 ♦ + * "*■ 

gr.cgi:ttqqtat9gf:ttcattcagctccggttcccaacqatcaaggcgagttacatgatc ^^^^ 

3661 * *~~ 

ccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatc^^^ ^^^^ 

gtcggccgcagLgctatcactcatggttatggcagcaccgcataattctcttactgtcat. ^^^^ 
+ + + — 

gccatccgtaagatgctttcctgtgactggtgagtaccc*accaagtcatrctgaga^^ ^^^^ 



3721 
376 
.1H41 



gtgtatgcggc:gaccgaqttgctctrgcccgtcqt«aatacgggataataccgcgccaca ^^^^ 

3901 + + * 

tagcag^^etrtaaaagtgcLcatcattggaaaacgttcttcggggcgaaaactctcaag 
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3961 + + ^. 

+ ^ ^ 

+ , _^ ^ ^ ^^^^ 

+ ^ ^ ^ ^^^^ 

^ ^ ^^^^ 

4 201 --!-l:?f!?^!!!"!"!??!!!'^'^^'=^!"9«g'=5gatacatatttgaatgtatti:a 

+ _^ ^ ^ ^^^^ 

" — * + + 4 320 

4321 !?!!~"-!!-!!l!!!?:!!!!!!"^^^«^^«^taggcgtatcacgaggccctttcg 

+ + ^ ^ ^^^^ 

tc 

43Si -- 4382 
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